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TWO HUMAN NSP-LTKE PROTEINS 

The present invention relates to nucleic acid and amino acid 
sequences of two novel human NSP-like proteins and to the use of these 
sequences in the diagnosis, study, prevention and treatment of disease. 

BACKGROUND ART 

Neuroendocrine-specific proteins (NSP-A, NSP-B, and NSP-C) are a 
recently characterized group of membrane-anchored endoplasmic reticulum 
(ER) proteins that share identical carboxy-terminal amino acid sequences 
{van de Velde HJ et al (1994) J Cell Sci 107:2403-2416). Evidence 
suggests that NSP-A and NSP-C expression is restricted to neuronal and 
endocrine cell populations (van de Velde, supra) . Immunohistochemical 
studies showed that rat NSP-A is expressed throughout the rat brain (van 
de Velde HJ et al (1994) Mol Brain Res 23:81-92). NSP-B, however, is 
found only in a small cell lung carcinoma cell line and probably 
represents an aberrant NSP gene product (Roebroek AJ et al (1993) J Biol 
Chem 268:13439-13447). A previously reported neuronally expressed rat 
gene, CI-13, and two partially sequenced human cDNAs (GI 391043 and GI 
894 620), have a high degree of homology to NSPs which suggests that NSPs 
belong to a larger family of proteins (Wieczorek DF et al (1991) Mol 
Brain Res 10:33-41; Bell GI et al (1993) Hum Mol Genet 2:1793-798; 
Martin-Galla A et al (1992) Nat Genet 1:34-39). 

Two large hydrophobic regions characterize the NSPs and homologous 
proteins and suggest membrane association. In fact, immunofluorescence 
and biochemical studies have established an association between NSPs and 
membranes of the ER (Senden NH et al (1994) Eur J Cell Biol 65:341-353). 
Analysis of NSP-A deletion mutants indicates that the carboxy-terminal 
hydrophobic region is necessary for membrane binding (van de Velde et al, 
supra) . Carboxy-terminal amino acid sequences of the NSPs are highly 
homologous, although they are not a perfect match to a consensus motif 
sufficient for retention of transmembrane proteins in the ER (van de 
Velde, supra; Jackson MR et al (1993) J Cell Biol 121:317-333). Thus, it 
appears likely that NSPs and related proteins are targeted to the ER by 
conserved carboxy-terminal amino acids. 

Immunostaining with anti-NSP-A antibodies suggests that NSP-A may 
be associated with both the rough and smooth neuronal ER. On the basis 
of this evidence and knowledge of neuronal ER function, van de Velde et 
al (1994; supra) conclude that NSPs may be involved in the protein 
transport process or in the regulation of intracellular calcium levels in 
neuronal cells* 
HSP-like Proteins and DiaaMo 

Dysfunction of ER-mediated neuronal protein transport may 
contribute to neurodegenerative diseases. For example, in amyotrophic 
lateral sclerosis (ALS) , a degenerative disease of motor neurons, 
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position of neurofilaments in neuronal axons leads to dramatic defects in 
ER-mediated axonal transport of a variety of proteins (Collard JF et al 
(1995) Nature 375:61-64). Defects in protein transport have been further 
implicated in the pathogenesis of ALS by a transgenic mouse study in 
which ALS is modeled by a mutation in superoxide dismutase (SOD) . SOD 
mutant animals displayed clinical and pathological features of human ALS 
and showed axonal transport defects associated with dilation of the ER 
(Mourelatos Z et al (1996) Proc Natl Acad Sci 93:5472-5477). 

Analysis of specimens of a wide variety of primary human tumors 
show that NSP-A and NSP-C are expressed in small cell lung carcinoma, 
carcinoid tumors of the lung, but not in non-neuroendocrine non-small 
cell lung carcinomas (van de Velde et al (1994) Cancer Res 54:4769-4776). 
Furthermore, antibodies generated to small-cell lung carcinoma surface 
antigens recognize NSP-A, NSP-B, and NSP-C. Therefore, NSPs may act as 
markers in human lung cancer diagnosis and provide an avenue for 
corrective treatment (Senden NH et al (1994) Int J Cancer Suppl 8:84-88). 

New NSP-like proteins could satisfy a need in the art by providing 
new means of diagnosing and treating cancer and neurodegenerative 
disorders such as ALS. 

DISCLOSURE OF THE INVENTION 

The present invention discloses two novel human NSP-like proteins 
(hereinafter referred to individually as NSPLPA and NSPLPB, and 
collectively as NSPLP) , characterized as having homology to human NSP-A 
(GI 307307), NSP-B (GI 307309), NSP-C (GI 307311), and rat CI-13 (GI 
28104 6) . Accordingly, the invention features two substantially purified 
NSP-like proteins, as shown in amino acid sequence of SEQ ID NO:l and SEQ 
ID NO: 3, and having characteristics of NSPs. 

One aspect of the invention features isolated and substantially 
purified polynucleotides which encode NSPLP. In a particular aspect, the 
polynucleotide is the nucleotide sequence of SEQ ID NO:2 or SEQ ID NO: 4. 
In addition, the invention features polynucleotide sequences that 
hybridize under stringent conditions to SEQ ID N0:2 or SEQ ID NO:4. 

The invention further relates to nucleic acid sequences encoding 
NSPLP, oligonucleotides, peptide nucleic acids (PNA) , fragments, portions 
or antisense molecules thereof, and expression vectors and host cells 
comprising polynucleotides which encode NSPLP. The present invention 
also relates to antibodies which bind specifically to NSPLP, 
pharmaceutical compositions comprising substantially purified NSPLP, 
fragments thereof, or antagonists of NSPLP, in conjunction with a 
suitable pharmaceutical carrier, and methods for producing NSPLP, 
fragments thereof, or antagonists of NSPLP. 

BRIEF DESCRIPTION OF DRAWINGS 
Figures 1A, IB and 1C show the amino acid sequence (SEQ ID NO:l) 
and nucleic acid sequence (SEQ ID NO: 2) of the novel NSP-like protein, 
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NSPLPA. The alignment was produced using MacDNAsis software (Hitachi 
Software Engineering Co Ltd) . 

Figures 2A, 2B and 2C show the amino acid sequence (SEQ ID NO: 3) 
and nucleic acid sequence (SEQ ID NO: 4) of the novel NSP-like protein, 
NSPLPB (MacDNAsis software, Hitachi Software Engineering Co Ltd) . 

Figures 3A, 3B, 3C, 3D and 3E show the northern analysis for the 
consensus sequence (SEQ ID NO:4). The northern analysis was produced 
electronically using LIFESEQ** database (Incyte Pharmaceuticals, Palo Alto 
CA) . 

Figures 4 A, 4B and 4C show the northern analysis for Incyte Clones 
31870 (SEQ ID NO: 2) (LIFESEQ m database, Incyte Pharmaceuticals, Palo Alto 
CA). 

Figure 5 shows the assembly for the consensus sequence (SEQ ID 

NO:2) . 

Figures 6A, 6B, 6C, 6D, 6E and 6F show the amino acid sequence 
alignments among NSPLPA (SEQ ID NO:l), NSPLPB (SEQ ID NO:3), NSP-A (GI 
307307; SEQ ID NO: 5), NSP-B (GI 307309; SEQ ID NO: 6), NSP-C (GI 307311); 
SEQ ID N0:7), and rat CI-13 (GI 281046 SEQ ID NO:8) produced using the 
multisequence alignment program of DNAStar software (DNAStar Inc, Madison 
WI) . 

Figure 7 shows the hydrophobicity plot (generated using MacDNAsis 
software) for NSPLPA, SEQ ID NO:l; the X axis reflects amino acid 
position, and the negative Y axis, hydrophobicity (Figs. 7, 8, and 9) . 

Figure 8 shows the hydrophobicity plot for NSPLPB, SEQ ID NO: 3. 

Figure 9 shows the hydrophobicity plot for NSP-C, SEQ ID NO: 7. 
MODES FOR CARRYING OUT THE INVENTION 

Definitions 

"Nucleic acid sequence" as used herein refers to an 
oligonucleotide, nucleotide or polynucleotide, and fragments or portions 
thereof, and to DNA or RNA of genomic or synthetic origin which may be 
single- or double-stranded, and represent the sense or antisense strand. 
Similarly, amino acid sequence as used herein refers to peptide or 
protein sequence. 

** Peptide nucleic acid" as used herein refers to a molecule which 
comprises an oligomer to which an amino acid residue, such as lysine, and 
an amino group have been added. These small molecules, also designated 
anti-gene agents, stop transcript elongation by binding to their 
complementary (template) strand of nucleic acid (Nielsen PE et al (1993) 
Anticancer Drug Des 8:53-63). 

As used herein, NSPLP refers to the amino acid sequences of 
substantially purified NSPLP obtained from any species, particularly 
mammalian, including bovine, ovine, porcine, murine, equine, and 
preferably human, from any source whether natural, synthetic, 
semi-synthetic or recombinant. 
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A "variant" of NSPLP is defined as an amino acid sequence that is 
altered by one or more amino acids. The variant may have "conservative" 
changes, wherein a substituted amino acid has similar structural or 
chemical properties, eg, replacement of leucine with isoleucine. More 
rarely, a variant may have "nonconservative" changes, eg, replacement of 
a glycine with a tryptophan. Similar minor variations may also include 
amino acid deletions or insertions, or both. Guidance in determining 
which and how many amino acid residues may be substituted, inserted or 
deleted without abolishing biological or immunological activity may be 
found using computer programs well known in the art, for example, DNAStar 
software. 

A "deletion" is defined as a change in either amino acid or 
nucleotide sequence in which one or more amino acid or nucleotide 
residues, respectively, are absent. 

An "insertion" or "addition" is that change in an amino acid or 
nucleotide sequence which has resulted in the addition of one or more 
amino acid or nucleotide residues, respectively, as compared to the 
naturally occurring NSPLP. 

A "substitution" results from the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, 
respectively. 

The term "biologically active" refers to a NSPLP having structural, 
regulatory or biochemical functions of a naturally occurring NSPLP. 
Likewise, " immunologically active" defines the capability of the natural, 
recombinant or synthetic NSPLP, or any oligopeptide thereof, to induce a 
specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The term "derivative" as used herein refers to the chemical 
modification of a nucleic acid encoding NSPLP or the encoded NSPLP. 
Illustrative of such modifications would be replacement of hydrogen by an 
alkyl, acyl, or amino group. A nucleic acid derivative would encode a 
polypeptide which retains essential biological characteristics of natural 
NSPLP. 

As used herein, the term ^substantially purified" refers to 
molecules, either nucleic or amino acid sequences, that are removed from 
their natural environment, isolated or separated, and are at least 60% 
free, preferably 75% free, and most preferably 90% free from other 
components with which they are naturally associated. 

"Stringency" typically occurs in a range from about Tm-5°C (5°C 
below the Tm of the probe) to about 20°C to 25°C below Tm. As will be 
understood by those of skill in the art, a stringency hybridization can 
be used to identify or detect identical polynucleotide sequences or to 
identify or detect similar or related polynucleotide sequences. 

The term "hybridization" as used herein shall include "any process 
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by which a strand of nucleic acid joins with a complementary strand 
through base pairing" (Coombs J (1994) Dictionary c£ Biotechnology . 
Stockton Press, New York NY) . Amplification as carried out in the 
polymerase chain reaction technologies is described in Dieffenbach CW and 
GS Dveksler (1995, PCR Primer , a Laboratory Manual . Cold Spring Harbor 
Press, Plainview NY) . 

The present invention relates to novel NSPLP and to the use of the 
nucleic acid and amino acid sequences in the study, diagnosis, prevention 
and treatment of disease. cDNAs encoding a portion of NSPLP were found 
in neuronal and endocrine tissue-derived cDNA libraries and in a variety 
of other tissues , including many types of tumors (Figs. 3A-3E and 4A-4C) . 

The present invention also encompasses NSPLP variants. A preferred 
NSPLP variant is one having at least 80% amino acid sequence similarity 
to the NSPLP amino acid sequence (SEQ ID NO:l), a more preferred NSPLP 
variant is one having at least 90% amino acid sequence similarity to SEQ 
ID NO:l and a most preferred NSPLP variant is one having at least 95% 
amino acid sequence similarity to SEQ ID NO:l. 

Nucleic acids encoding the human NSPLP of the present invention 
were first identified in cDNA, Incyte Clones 31870 (SEQ ID NO: 4; THP-1 
cell cDMA library, THP1NOB01) and 28742 (SEQ ID NO:9; fetal spleen cDNA 
library, SPLNFET01), through a computer-generated search for amino acid 
sequence alignments. A consensus sequence, SEQ ID NO: 2, was derived from 
the following overlapping nucleic acid sequences: Incyte Clones 28742 
(from cDNA library SPLNFET01) ; 45022, 45074, and 45509 (CORNNOT01) ; 
121581 (MUSCNOT01); 570122 (MMLR3DT01) ; and 754150 (BRATUT02; Fig. 5). 
The nucleic acid sequence of SEQ ID NO: 2 encodes the NSPLPA amino acid 
sequence, SEQ ID N0:1. The nucleic acid sequence of SEQ ID NO: 4 encodes 
the NSPLPB amino acid sequence, SEQ ID NO: 3. The nucleic acid sequence 
of SEQ ID NO: 4 from residue C„< to T TM has 97% identity to the partial 
cDNA sequence of clone hbc043 (GI 39104; Bell et al, supra) . 

The present invention is based, in part, on the chemical and 
structural homology among NSPLPA, NSPLPB, NSP-A (GI 307307; Roebroek et 
al, supra), NSP-B (GI 307309; Roebroek et al, supra), NSP-C (GI 307311; 
Roebroek et al, supra), and rat CI-13 (GI 281046; Wieczorek et al, supra; 
Figs. 6A-D) . NSPLPA and NSP-C share 66% identity, NSPLPB and NSP-C share 
48% identity, while NSPLPA and NSPLPB share 50% identity. As illustrated 
by Figures 7, 8, and 9, NSPLPA, NSPLPB, and NSP-C have similar 
hydrophobicity plots suggesting similar structure. Like the NSPs, NSPLPA 
and NSPLPB have two large hydrophobic regions that could be used for 
membrane attachment. The carboxy- terminal amino acids Lys m through Lys 197 
of NSPLPA precisely match, in position as well as sequence, an ER 
retention motif defined by Jackson et al (1993; supra). The novel NSPLPA 
is 199 amino acids long and has one potential N glycosylation site. The 
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novel NSPLPB is 241 amino acids long. 
Thft HSPLP Coding 9aquoncQa 

The nucleic acid and deduced amino acid sequences of NSPLP are 
shown in Figures 1A, IB, 1C, 2A, 2B and 2C. In accordance with the 
invention, any nucleic acid sequence which encodes the amino acid 
sequence of NSPLP can be used to generate recombinant molecules which 
express NSPLP. In a specific embodiment described herein, a nucleotide 
sequence encoding a portion of NSPLP was first isolated as Incyte Clones 
31B70 from a THP-1 cell cDNA library (THP1NOB01) . While, Incyte Clone 
2B742 was first isolated from a fetal spleen cDNA library (SPLNFET01). 

It will be appreciated by those skilled in the art that as a result 
of the degeneracy of the genetic code, a multitude of NSPLP- encoding 
nucleotide sequences, some bearing minimal homology to the nucleotide 
sequences of any known and naturally occurring gene may be produced. The 
invention contemplates each and every possible variation of nucleotide 
sequence that could be made by selecting combinations based on possible 
codon choices. These combinations are made in accordance with the 
standard triplet genetic code as applied to the nucleotide sequence of 
naturally occurring NSPLP, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode NSPLP and its variants 
are preferably capable of hybridizing to the nucleotide sequence of the 
naturally occurring NSPLP under appropriately selected conditions of 
stringency, it may be advantageous to produce nucleotide sequences 
encoding NSPLP or its derivatives possessing a substantially different 
codon usage. Codons may be selected to increase the rate at which 
expression of the peptide occurs in a particular prokaryotic or 
eukaryotic expression host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding NSPLP and its 
derivatives without altering the encoded amino acid sequences include the 
production of RNA transcripts having more desirable properties, such as a 
greater half -life, than transcripts produced from the naturally occurring 
sequence . 

It is now possible to produce a DNA sequence, or portions thereof, 
encoding a NSPLP and its derivatives entirely by synthetic chemistry, 
after which the synthetic gene may be inserted into any of the many 
available DNA vectors and cell systems using reagents that are well known 
in the art at the time of the filing of this application. Moreover, 
synthetic chemistry may be used to introduce mutations into a sequence 
encoding NSPLP or any portion thereof. 

Also included within the scope of the present invention are 
polynucleotide sequences that are capable of hybridizing to the 
nucleotide sequences of figures 1A, IB, 1C, 2A, 2B, and 2C under various 
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conditions of stringency. Hybridization conditions are based on the 
melting temperature (Tm) of the nucleic acid binding complex or probe, as 
taught in Berger and Kimmel (1987, Guide to Molecular Cloning 
Techniques. Methods in Enzvmology. Vol 152, Academic Press, San Diego CA) 
incorporated herein by reference, and confer may be used at a defined 
stringency. 

Altered nucleic acid sequences encoding NSPLP which may be used in 
accordance with the invention include deletions, insertions or 
substitutions of different nucleotides resulting in a polynucleotide that 
encodes the same or a functionally equivalent NSPLP. The protein may 
also show deletions, insertions or substitutions of amino acid residues 
which produce a silent change and result in a functionally equivalent 
NSPLP. Deliberate amino acid substitutions may be made on the basis of 
similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues as long as 
the biological activity of NSPLP is retained. For example, negatively 
charged amino acids include aspartic acid and glutamic acid; positively 
charged amino acids include lysine and arginine; and amino acids with 
uncharged polar head groups having similar hydrophilicity values include 
leucine, isoleucine, valine; glycine, alanine; asparagine, glutamine; 
serine, threonine phenylalanine, and tyrosine. 

Included within the scope of the present invention are alleles of 
NSPLP. As used herein, an "allele" or "allelic sequence" is an 
alternative form of NSPLP. Alleles result from a mutation, ie, a change 
in the nucleic acid sequence, and generally produce altered inRNAs or 
polypeptides whose structure or function may or may not be altered. Any 
given gene may have none, one or many allelic forms. Common mutational 
changes which give rise to alleles are generally ascribed to natural 
deletions, additions or substitutions of amino acids. Each of these 
types of changes may occur alone, or in combination with the others, one 
or more times in a given sequence. 

Methods for DNA sequencing are well known in the art and employ 
such enzymes as the Klenow fragment of DNA polymerase I, Sequenase® (OS 
Biochemical Corp, Cleveland OH)), Taq polymerase (Perkin Elmer, Norwalk 
CT), thermostable T7 polymerase (Amersham, Chicago IL) , or combinations 
of recombinant polymerases and proofreading exonucleases such as the 
ELONGASE Amplification System marketed by Gibco BRL (Gaithersburg MD) . 
Preferably, the process is automated with machines such as the Hamilton 
Micro Lab 2200 (Hamilton, Reno NV) , Peltier Thermal Cycler (PTC200; MJ 
Research, Watertown MA) and the ABI 377 DNA sequencers (Perkin Elmer) . 
Extending the Polynucleotide Sequence 

The polynucleotide sequence encoding NSPLP may be extended 
utilizing partial nucleotide sequence and various methods known in the 
art to detect upstream sequences such as promoters and regulatory 
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elements. Gobinda et al (1993; PCR Methods Applic 2:318-22) disclose 
"restriction-site" polymerase chain reaction (PCR) as a direct method 
which uses universal primers to retrieve unknown sequence adjacent to a 
known locus. First, genomic DNA is amplified in the presence of primer 
to a linker sequence and a primer specific to the known region. The 
amplified sequences are subjected to a second round of PCR with the same 
linker primer and another specific primer internal to the first one. 
Products of each round of PCR are transcribed with an appropriate RNA 
polymerase and sequenced using reverse transcriptase. 

Inverse PCR can be used to amplify or extend sequences using 
divergent primers based on a known region (Triglia T et al (1988) Nucleic 
Acids Res 16:81B6) . The primers may be designed using OLIGO® 4.06 Primer 
Analysis Software (1992; National Biosciences Inc, Plymouth MN) , or 
another appropriate program, to be 22-30 nucleotides in length, to have a 
GC content of 50% or more, and to anneal to the target sequence at 
temperatures about 68°-72° C. The method uses several restriction 
enzymes to generate a suitable fragment in the known region of a gene. 
The fragment is then circularized by intramolecular ligation and used as 
a PCR template. 

Capture PCR (Lagerstrom M et al (1991) PCR Methods Applic 1:111-19) 
is a method for PCR amplification of DNA fragments adjacent to a known 
sequence in human and yeast artificial chromosome DNA. Capture PCR also 
requires multiple restriction enzyme digestions and ligations to place an 
engineered double-stranded sequence into an unknown portion of the DNA 
molecule before PCR. 

Another method which may be used to retrieve unknown sequences is 
that of Parker JD et al (1991; Nucleic Acids Res 19:3055-60). 
Additionally, one can use PCR, nested primers and PromoterFinder 
libraries to walk in genomic DNA ( Promote rFinder ,H Clontech (Palo Alto 
CA) . This process avoids the need to screen libraries and is useful in 
finding intron/exon junctions. 

Preferred libraries for screening for full length cDNAs are ones 
that have been size-selected to include larger cDNAs. Also, random 
primed libraries are preferred in that they will contain more sequences 
which contain the 5* and upstream regions of genes. A randomly primed 
library may be particularly useful if an oligo d(T) library does not 
yield a full-length cDNA. Genomic libraries are useful for extension 
into the 5' nontranslated regulatory region. 

Capillary electrophoresis may be used to analyze the size or 
confirm the nucleotide sequence of sequencing or PCR products. Systems 
for rapid sequencing are available from Perkin Elmer, Beckman Instruments 
(Fullerton CA) , and other companies. Capillary sequencing may employ 
flowable polymers for electrophoretic separation, four different 
fluorescent dyes (one for each nucleotide) which are laser activated, and 
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detection of the emitted wavelengths by a charge coupled devise camera. 
Output/light intensity is converted to electrical signal using 
appropriate software (eg. Genotyper w and Sequence Navigator* from Per kin 
Elmer) and the entire process from loading of samples to computer 
analysis and electronic data display is computer controlled. Capillary 
electrophoresis is particularly suited to the sequencing of small pieces 
of DNA which might be present in limited amounts in a particular sample. 
The reproducible sequencing of up to 350 bp of M13 phage DNA in 30 min 
has been reported (Ruiz -Martinez MC et al (1993) Anal Chem 65:2851-2858). 
Expression of the Nucleotide Samiftnctt 

In accordance with the present invention, polynucleotide sequences 
which encode NSPLP, fragments of the polypeptide, fusion proteins or 
functional equivalents thereof may be used in recombinant DNA molecules 
that direct the expression of NSPLP in appropriate host cells. Due to 
the inherent degeneracy of the genetic code, other DNA sequences which 
encode substantially the same or a functionally equivalent amino acid 
sequence, may be used to clone and express NSPLP. As will be understood 
by those of skill in the art, it may be advantageous to produce 
NSPLP-encoding nucleotide sequences possessing non-naturally occurring 
codons. Codons preferred by a particular prokaryotic or eukaryotic host 
(Murray E et al (1989) Nuc Acids Res 17:477-508) can be selected, for 
example, to increase the rate of NSPLP expression or to produce 
recombinant RNA transcripts having desirable properties, such as a longer 
half-life, than transcripts produced from naturally occurring sequence. 

The nucleotide sequences of the present invention can be engineered 
in order to alter a NSPLP coding sequence for a variety of reasons, 
including but not limited to, alterations which modify the cloning, 
processing and/or expression of the gene product. For example, mutations 
may be introduced using techniques which are well known in the art, eg, 
site-directed mutagenesis to insert new restriction sites, to alter 
glycosylation patterns, to change codon preference, to produce splice 
variants, etc. 

In another embodiment of the invention, a natural, modified or 
recombinant polynucleotides encoding NSPLP may be ligated to a 
heterologous sequence to encode a fusion protein. For example, for 
screening of peptide libraries for inhibitors of NSPLP activity, it may 
be useful to encode a chimeric NSPLP protein that is recognized by a 
commercially available antibody. A fusion protein may also be engineered 
to contain a cleavage site located between a NSPLP sequence and the 
heterologous protein sequence, so that the NSPLP may be cleaved and 
purified away from the heterologous moiety. 

In an alternate embodiment of the invention, the coding sequence of 
NSPLP may be synthesized, whole or in part, using chemical methods well 
known in the art (see Caruthers MH et al (1980) Nuc Acids Res Symp Ser 
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215-23, Horn T et al(19B0) Nuc Acids Res Symp Ser 225-32, etc). 
Alternatively, the protein itself could be produced using chemical 
methods to synthesize a NSPLP amino acid sequence, whole or in part. For 
example, peptide synthesis can be performed using various solid-phase 
techniques (Roberge JY et al (1995) Science 269:202-204) and automated 
synthesis may be achieved, for example, using the ABI 431A Peptide 
Synthesizer (Per kin Elmer) in accordance with the instructions provided 
by the manufacturer. 

The newly synthesized peptide can be substantially by preparative 
high performance liquid chromatography (eg, Creighton (19B3) Proteins . 
Structures dad Molecular Principles. WH Freeman and Co, New York NY) . 
The composition of the synthetic peptides may be confirmed by amino acid 
analysis or sequencing (eg, the Edman degradation procedure; Creighton, 
supra). Additionally the amino acid sequence of NSPLP, or any part 
thereof, may be altered during direct synthesis and/or combined using 
chemical methods with sequences from other proteins, or any part thereof, 
to produce a variant polypeptide. 

In order to express a biologically active NSPLP, the nucleotide 
sequence encoding NSPLP or its functional equivalent, is inserted into an 
appropriate expression vector, ie, a vector which contains the necessary 
elements for the transcription and translation of the inserted coding 
sequence . 

Methods which are well known to those skilled in the art can be 
used to construct expression vectors containing a NSPLP coding sequence 
and appropriate transcriptional or translational controls. These methods 
include An yi££2 recombinant DNA techniques, synthetic techniques and in 
Yjvo recombination or genetic recombination. Such techniques are 
described in Sambrook et al (1989) HplecuJar £loiiina, & Laboratory 
Manual . Cold Spring Harbor Press, Plainview NY and Ausubel FM et al 
(1989) Current PrPtQCQiS XXX Molecular Biology . John Wiley & Sons, New 
York NY. 

A variety of expression vector/host systems may be utilized to 
contain and express a NSPLP coding sequence. These include but are not 
limited to microorganisms such as bacteria transformed with recombinant 
bacteriophage, plasmid or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected 
with virus expression vectors (eg, baculovirus) ; plant cell systems 
transfected with virus expression vectors (eg, cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or transformed with bacterial expression 
vectors (eg, Ti or pBR322 plasmid); or animal cell systems. 

The "control elements" or "regulatory sequences" of these systems 
vary in their strength and specificities and are those nontranslated 
regions of the vector, enhancers, promoters, and 3' untranslated regions, 
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which interact with host cellular proteins to carry out transcription and 
translation. Depending on the vector system and host utilized, any 
number of suitable transcription and translation elements, including 
constitutive and inducible promoters, may be used. For example, when 
cloning in bacterial systems, inducible promoters such as the hybrid lacZ 
promoter of the Bluescript® phagemid (Stratagene, LaJolla CA) or pSportl 
(Gibco BRL) and ptrp-lac hybrids and the like may be used. The 
baculovirus polyhedrin promoter may be used in insect cells. Promoters 
or enhancers derived from the genomes of plant cells (eg, heat shock, 
ROBISCO; and storage protein genes) or from plant viruses (eg, viral 
promoters or leader sequences) may be cloned into the vector. In 
mammalian cell systems, promoters from the mammalian genes or from 
mammalian viruses are most appropriate. If it is necessary to generate a 
cell line that contains multiple copies of NSPLP, vectors based on SV40 
or EBV may be used with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be 
selected depending upon the use intended for NSPLP. For example, when 
large quantities of NSPLP are needed for the induction of antibodies, 
vectors which direct high level expression of fusion proteins that are 
readily purified may be desirable. Such vectors include, but are not 
limited to, the multifunctional £. coli cloning and expression vectors 
such as Bluescript® (Stratagene) , in which the NSPLP coding sequence may 
be ligated into the vector in frame with sequences for the amino-terminal 
Met and the subsequent 7 residues of fl-galactosidase so that a hybrid 
protein is produced; pIN vectors (Van Heeke & Schuster (1989) J Biol Chem 
264:5503-5509); and the like. pGEX vectors (Promega, Madison WI) may 
also be used to express foreign polypeptides as fusion proteins with 
glutathione S-transf erase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to 
glutathione-agarose beads followed by elution in the presence of free 
glutathione. Proteins made in such systems are designed to include 
heparin, thrombin or factor XA protease cleavage sites so that the cloned 
polypeptide of interest can be released from the GST moiety at will. 

In the yeast, SacnharrnnvrA*; cerevisiae. a number of vectors 
containing constitutive or inducible promoters such as alpha factor, 
alcohol oxidase and PGH may be used. For reviews, see Ausubel et al 
(supra) and Grant et al (1987) Methods in Enzymology 153:516-544. 

In cases where plant expression vectors are used, the expression of 
a sequence encoding NSPLP may be driven by any of a number of promoters. 
For example, viral promoters such as the 35S and 19S promoters of CaMV 
(Brisson et al (1984) Nature 310:511-514) may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu et al 
(1987) EMBO J 6:307-311). Alternatively, plant promoters such as the 
small subunit of RUBISCO (Coruzzi et al (1984) EMBO J 3:1671-1680; 
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Broglie et al (1984) Science 224:838-843); or heat shock promoters 
(Winter J and Sinibaldi RM (1991) Results Probl Cell Differ 17:85-105) 
may be used. These constructs can be introduced into plant cells by 
direct DNA transformation or pathogen-mediated transfection. For reviews 
of such techniques, see Hobbs S or Murry LE in McGraw Hill Yearbook of 
Science aj^i Technology (1992) McGraw Hill New York NY, pp 191-196 or 
Weissbach and Weissbach (1988) Methods for Plant Molecular Biology , 
Academic Press, New York NY, pp 421-463. 

An alternative expression system which could be used to express 
NSPLP is an insect system. In one such system, Autoarapha californica 
nuclear polyhedrosis virus (AcNPV) is used as a vector to express foreign 
genes in Spodoptera frugiperda cells or in Trichoolusia larvae. The 
NSPLP coding sequence may be cloned into a nonessential region of the 
virus, such as the polyhedrin gene, and placed under control of the 
polyhedrin promoter. Successful insertion of NSPLP will render the 
polyhedrin gene inactive and produce recombinant virus lacking coat 
protein coat. The recombinant viruses are then used to infect 
fruaiperda cells or Trichoolnsia larvae in which NSPLP is expressed 
(Smith et al (1983) J Virol 46:584; Engelhard EK et al (1994) Proc Nat 
Acad Sci 91:3224-7) . 

In mammalian host cells, a number of viral-based expression systems 
may be utilized. In cases where an adenovirus is used as an expression 
vector, a NSPLP coding sequence may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and 
tripartite leader sequence. Insertion in a nonessential El or E3 region 
of the viral genome will result in a viable virus capable of expressing 
NSPLP in infected host cells (Logan and Shenk (1984) Proc Natl Acad Sci 
81:3655-59). In addition, transcription enhancers, such as the rous 
sarcoma virus (RSV) enhancer, may. be used to increase expression in 
mammalian host cells. 

Specific initiation signals may also be required for efficient 
translation of a NSPLP sequence. These signals include the ATG 
initiation codon and adjacent sequences. In cases where NSPLP, its 
initiation codon and upstream sequences are inserted into the appropriate 
expression vector, no additional translational control signals may be 
needed. However, in cases where only coding sequence, or a portion 
thereof, is inserted, exogenous transcriptional control signals including 
the ATG initiation codon must be provided. Furthermore, the initiation 
codon must be in the correct reading frame to ensure transcription of the 
entire insert. Exogenous transcriptional elements and initiation codons 
can be of various origins, both natural and synthetic. The efficiency of 
expression may be enhanced by the inclusion of enhancers appropriate to 
the cell system in use (Scharf D et al (1994) Results Probl Cell Differ 
20:125-62; Bittner et al (1987) Methods in Enzymol 153:516-544). 
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In addition, a host cell strain may be chosen for its ability to 
modulate the expression of the inserted sequences or to process the 
expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation and acylation. 
Post-translational processing which cleaves a "prepro" form of the 
protein may also be important for correct insertion, folding and/or 
function. Different host cells such as CHO, HeLa, MDCK, 293, WI38, etc 
have specific cellular machinery and characteristic mechanisms for such 
post-translational activities and may be chosen to ensure the correct 
modification and processing of the introduced, foreign protein . 

For long-term, high-yield production of recombinant proteins, 
stable expression is preferred. For example, cell lines which stably 
express NSPLP may be transformed using expression vectors which contain 
viral origins of replication or endogenous expression elements and a 
selectable marker gene. Following the introduction of the vector, cells 
may be allowed to grow for 1-2 days in an enriched media before they are 
switched to selective media. The purpose of the selectable marker is to 
confer resistance to selection, and its presence allows growth and 
recovery of cells which successfully express the introduced sequences. 
Resistant clumps of stably transformed cells can be proliferated using 
tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex 
virus thymidine kinase (Wigler M et al (1977) Cell 11:223-32) and adenine 
phosphoribosyltransferase (Lowy I et al (1980) Cell 22:817-23) genes 
which can be employed in tk- or aprt- cells, respectively. Also, 
antimetabolite, antibiotic or herbicide resistance can be used as the 
basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler M et al (1980) Proc Natl Acad Sci 77:3567-70); npt, 
which confers resistance to the aminoglycosides neomycin and G-418 
(Colbere-Garapin F et al (1981) J Mol Biol 150:1-14) and als or pat, 
which confer resistance to chlorsulfuron and phosphinotricin 
ace tyltransf erase, respectively (Murry, supra). Additional selectable 
genes have been described, for example, trpB, which allows cells to 
utilize indole in place of tryptophan, or hisD, which allows cells to 
utilize histinol in place of histidine (Hartman SC and RC Mulligan (1988) 
Proc Natl Acad Sci 85:8047-51). Recently, the use of visible markers has 
gained popularity with such markers as anthocyanins, B glucuronidase and 
its substrate, GUS, and luciferase and its substrate, luciferin, being 
widely used not only to identify transf ormants, but also to quantify the 
amount of transient or stable protein expression attributable to a 
specific vector system (Rhodes CA et al (1995) Methods Mol Biol 
55:121-131) . 
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If 4 ^ 4 ^ 4 ^*** 0 ** Qf Tranafornanta Containing fcha Polynucleotide Soautnce 
Although the presence /absence of marker gene expression suggests 
that the gene of interest is also present, its presence and expression 
should be confirmed. For example, if the NSPLP is inserted within a 
marker gene sequence, recombinant cells containing NSPLP can be 
identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a NSPLP sequence under the 
control of a single promoter. Expression of the marker gene in response 
to induction or selection usually indicates expression of the tandem 
NSPLP as well. 

Alternatively, host cells which contain the coding sequence for 
NSPLP and express NSPLP may be identified by a variety of procedures 
known to those of skill in the art. These procedures include, but are 
not limited to, DNA-DNA or DNA-RNA hybridization and protein bioassay or 
immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of the nucleic acid 
or protein. 

The presence of the polynucleotide sequence encoding NSPLP can be 
detected by DNA-DNA or DNA-RNA hybridization or amplification using 
probes, portions or fragments of polynucleotides encoding NSPLP. Nucleic 
acid amplification based assays involve the use of oligonucleotides or 
oligomers based on the NSPLP-encoding sequence to detect transf ormants 
containing DNA or RNA encoding NSPLP. As used herein "oligonucleotides" 
or "oligomers" refer to a nucleic acid sequence of at least about 10 
nucleotides and as many as about 60 nucleotides, preferably about 15 to 
30 nucleotides, and more preferably about 20-25 nucleotides which can be 
used as a probe or ampliraer. 

A variety of protocols for detecting and measuring the expression 
of NSPLP, using either polyclonal or monoclonal antibodies specific for 
the protein are known in the art. Examples include enzyme-linked 
immunosorbent assay (ELISA) , radioimmunoassay (RIA) and fluorescent 
activated cell sorting (FACS) . A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non-interfering epitopes 
on NSPLP is preferred, but a competitive binding assay may be employed. 
These and other assays are described, among other places, in Hampton R et 
al (1990, Serological Methods. & Laboratory Manual, APS Press, St Paul 
MN) and Maddox DE et al (1983, J Exp Med 158:1211). 

A wide variety of labels and conjugation techniques are known by 
those skilled in the art and can be used in various nucleic acid and 
amino acid assays. Means for producing labeled hybridization or PCR 
probes for detecting sequences related to polynucleotides encoding NSPLP 
include oligolabeling, nick translation, end-labeling or PCR 
amplification using a labeled nucleotide. Alternatively, the NSPLP 
sequence, or any portion of it, may be cloned into a vector for the 
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production of an mRHA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro 
by addition of an appropriate RNA polymerase such as T7, T3 or SP6 and 
labeled nucleotides. 

A number of companies such as Pharmacia Biotech (Piscataway NJ) , 
Promega (Madison WI), and US Biochemical Corp (Cleveland OH) supply 
commercial kits and protocols for these procedures. Suitable reporter 
molecules or labels include those radionuclides, enzymes, fluorescent, 
chemi luminescent, or chromogenic agents as well as substrates, cof actors, 
inhibitors, magnetic particles and the like. Patents teaching the use of 
such labels include US Patents 3,817,837; 3,850,752; 3,939,350; 
3,996,345; 4,277,437; 4,275,149 and 4,366,241. Also, recombinant 
immunoglobulins may be produced as shown in US Patent No. 4,816,567 
incorporated herein by reference. 
Purification °f WPLP 

Host cells transformed with a nucleotide sequence encoding NSPLP 
may be cultured under conditions suitable for the expression and recovery 
of the encoded protein from cell culture. The protein produced by a 
recombinant cell may be secreted or contained intraceliularly depending 
on the sequence and/or the vector used. As will be understood by those 
of skill in the art, expression vectors containing polynucleotides 
encoding NSPLP can be designed with signal sequences which direct 
secretion of NSPLP through a prokaryotic or eukaryotic cell membrane. 
Other recombinant constructions may join NSPLP to nucleotide sequence 
encoding a polypeptide domain which will facilitate purification of 
soluble proteins (Kroll DJ et al (1993) DNA Cell Biol 12:441-53; cf 
discussion of vectors infra containing fusion proteins) . 

NSPLP may also be expressed as a recombinant protein with one or 
more additional polypeptide domains added to facilitate protein 
purification. Such purification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tryptophan 
modules that allow purification on immobilized metals, protein A domains 
that allow purification on immobilized immunoglobulin, and the domain 
utilized in the FLAGS extension/affinity purification system (Immunex 
Corp, Seattle WA) . The inclusion of a cleavable linker sequences such as 
Factor XA or enterokinase {Invitrogen, San Diego CA) between the 
purification domain and NSPLP is useful to facilitate purification. One 
such expression vector provides for expression of a fusion protein 
compromising an NSPLP and contains nucleic acid encoding 6 histidine 
residues followed by thioredoxin and an enterokinase cleavage site. The 
histidine residues facilitate purification on IMIAC (immobilized metal 
ion affinity chromotography as described in Porath et al (1992) Protein 
Expression and Purification 3: 263-281) while the enterokinase cleavage 
site provides a means for purifying NSPLP from the fusion protein. 
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In addition to recombinant production, fragments of NSPLP may be 
produced by direct peptide synthesis using solid-phase techniques (cf 
Stewart et al (1969) Sol id- Phase Peptide Synthesis . WH Freeman Co, San 
Francisco; Merrifieid J (1963) J Am Chem Soc 85:2149-2154). In vitro 
protein synthesis may be performed using manual techniques or by 
automation. Automated synthesis may be achieved, for example, using 
Applied Biosystems 431A Peptide Synthesizer {Perkin Elmer, Foster City 
CA) in accordance with the instructions provided by the manufacturer. 
Various fragments of NSPLP may be chemically synthesized separately and 
combined using chemical methods to produce the full length molecule. 
VMP of N8F&P 

The rationale for use of the nucleotide and polypeptide sequences 
disclosed herein is based in part on the chemical and structural homology 
among the novel NSPLP proteins disclosed herein, NSP-A (GI 30730*7; 
Roebroek et al, supra), NSP-B (GI 307309; Roebroek et al, supra), NSP-C 
• (GI 307311; Roebroek et al, supra), and rat CI-13 (GI 281046; Wieczorek 
et al, supra) . 

Accordingly, NSPLP or a NSPLP derivative may be used to treat 
cancer and neurodegenerative disorders, such as ALS. In those conditions 
where NSPLP protein activity is not desirable, cells could be transfected 
with antisense sequences of NSPLP-encoding polynucleotides or provided 
with antagonists of NSPLP. 

NBPLP Antibody 

NSPLP-specif ic antibodies are useful for the diagnosis of 
conditions and diseases associated with expression of NSPLP. Such 
antibodies may include, but are not limited to, polyclonal, monoclonal, 
chimeric, single chain, Fab fragments and fragments produced by a Fab 
expression library. Neutralizing antibodies, ie, those which inhibit 
dimer formation, are especially preferred for diagnostics and 
therapeutics. 

NSPLP for antibody induction does not require biological activity; 
however, the protein fragment, or oligopeptide must be antigenic. 
Peptides used to induce specific antibodies may have an amino acid 
sequence consisting of at least five amino acids, preferably at least 10 
amino acids. Preferably, they should mimic a portion of the amino acid 
sequence of the natural protein and may contain the entire amino acid 
sequence of a small, naturally occurring molecule. Short stretches of 
NSPLP amino acids may be fused with those of another protein such as 
keyhole limpet hemocyanin and antibody produced against the chimeric 
molecule. Procedures well known in the art can be used for the 
production of antibodies to NSPLP. 

For the production of antibodies, various hosts including goats, 
rabbits, rats, mice, etc may be immunized by injection with NSPLP or any 
portion, fragment or oligopeptide which retains immunogenic properties. 
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Depending on the host species, various adjuvants may be used to increase 
immunological response. Such adjuvants include but are not limited to, 
Freund's, mineral gels such as aluminum hydroxide, and surface active 
substances such as lysolecithin, pluronic polyols, polyanions, peptides, 
oil emulsions, keyhole limpet hemocyanin, and dinitrophenol . BCG 
(bacilli Calmette-Guerin) and Corynebacterium parvum are potentially 
useful human adjuvants. 

Monoclonal antibodies to NSPLP may be prepared using any technique 
which provides for the production of antibody molecules by continuous 
cell lines in culture. These include but are not limited to the 
hybridoma technique originally described by Koehler and Milstein (1975 
Nature 256:495-497), the human B-cell hybridoma technique (Kosbor et al 
(1983) Immunol Today 4:72; Cote et al (1983) Proc Natl Acad Sci 
80:2026-2030) and the EBV-hybridoma technique (Cole et al (1985) 
Monoclonal Antibodies and Cancer Therapy, Alan R Liss Inc, New York NY, 
pp 77-96). 

In addition, techniques developed for the production of "chimeric 
antibodies", the splicing of mouse antibody genes to human antibody genes 
to obtain a molecule with appropriate antigen specificity and biological 
activity can be used (Morrison et al (1984) Proc Natl Acad Sci 
81:6851-6855; Neuberger et al (1984) Nature 312:604-608; Takeda et al 
(1985) Nature 314:452-454) . Alternatively, techniques described for the 
production of single chain antibodies (US Patent No. 4,946,778) can be 
adapted to produce NSPLP-specif ic single chain antibodies 

Antibodies may also be produced by inducing ia vivo production in 
the lymphocyte population or by screening recombinant immunoglobulin 
libraries or panels of highly specific binding reagents as disclosed in 
Orlandi et al (1989, Proc Natl Acad Sci B6: 3833-3837), and Winter G and 
Milstein C (1991; Nature 349:293-299). 

Antibody fragments which contain specific binding sites for NSPLP 
may also be generated. For example, such fragments include, but are not 
limited to, the F(ab')2 fragments which can be produced by pepsin 
digestion of the antibody molecule and the Fab fragments which can be 
generated by reducing the disulfide bridges of the F(ab')2 fragments. 
Alternatively, Fab expression libraries may be constructed to allow rapid 
and easy identification of monoclonal Fab fragments with the desired 
specificity (Huse WD et al (1989) Science 256:1275-1281). 

A variety of protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established 
specificities are well known in the art. Such immunoassays typically 
involve the formation of complexes between NSPLP and its specific 
antibody and the measurement of complex formation. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to 
two noninterfering epitopes on a specific NSPLP protein is preferred, but 
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a competitive binding assay may also be employed. These assays are 
described in Maddox DE et al (1983, J Exp Med 158:1211). 

pi^ft^ig Moy' tt * 4t1 " mbpt.p ftpt^fin ftntihmiiOT 

Particular NSPLP antibodies are useful for the diagnosis of 
conditions or diseases characterized by expression of NSPLP or in assays 
to monitor patients being treated with NSPLP, agonists or inhibitors. 
Diagnostic assays for NSPLP include methods utilizing the antibody and a 
label to detect NSPLP in human body fluids or extracts of cells or 
tissues. The polypeptides and antibodies of the present invention may be 
used with or without modification. Frequently, the polypeptides and 
antibodies will be labeled by joining them, either covalently or 
noncovalently, with a reporter molecule. A wide variety of reporter 
molecules are known, several of which were described above. 

A variety of protocols for measuring NSPLP, using either polyclonal 
or monoclonal antibodies specific for the respective protein are known in 
the art. Examples include enzyme- linked immunosorbent assay (ELISA) , 
radioimmunoassay (RIA) and fluorescent activated cell sorting (FACS) . A 
two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering epitopes on NSPLP is preferred, but a 
competitive binding assay may be employed. These assays are described, 
among other places, in Maddox, DE et al (1983, J Exp Med 158:1211). 

In order to provide a basis for diagnosis, normal or standard 
values for NSPLP expression must be established. This is accomplished by 
combining body fluids or cell extracts taken from normal subjects, either 
animal or human, with antibody to NSPLP under conditions suitable for 
complex formation which are well known in the art. The amount of 
standard complex formation may be quantified by comparing various 
artificial membranes containing known quantities of NSPLP with both 
control and disease samples from biopsied tissues. Then, standard values 
obtained from normal samples may be compared with values obtained from 
samples from subjects potentially affected by disease. Deviation between 
standard and subject values establishes the presence of disease state. 

Praq Serening 

NSPLP, its catalytic or immunogenic fragments or oligopeptides 
thereof, can be used for screening therapeutic compounds in any of a 
variety of drug screening techniques. The fragment employed in such a 
test may be free in solution, affixed to a solid support, borne on a eel' 
surface, or located intracellularly. The formation of binding complexes, 
between NSPLP and the agent being tested, may be measured. 

Another technique for drug screening which may be used provides for 
high throughput screening of compounds having suitable binding affinity 
to the NSPLP is described in detail in " Determination of Amino Acid 
Sequence Antigenicity" by Geysen HN, WO Application 84/03564, published 
on September 13, 1984, and incorporated herein by reference. In summary, 
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large numbers of different small peptide test compounds are synthesized 
on a solid substrate, such as plastic pins or some other surface. The 
peptide test compounds are reacted with fragments of NSPLP and washed. 
Bound NSPLP is then detected by methods well known in the art. Purified 
NSPLP can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and 
immobilize it on a solid support. 

This invention also contemplates the use of competitive drug 
screening assays in which neutralizing antibodies capable of binding 
NSPLP specifically compete with a test compound for binding NSPLP. In 
this manner, the antibodies can be used to detect the presence of any 
peptide which shares one or more antigenic determinants with NSPLP. 
Uses of the Polynucleotide Encoding NSPLP 

A polynucleotide encoding NSPLP, or any part thereof, may be used 
for diagnostic and/or therapeutic purposes. For diagnostic purposes, 
polynucleotides encoding NSPLP of this invention may be used to detect 
and quantitate gene expression in biopsied tissues in which expression of 
NSPLP may be implicated. The diagnostic assay is useful to distinguish 
between absence, presence, and excess expression of NSPLP and to monitor 
regulation of NSPLP levels during therapeutic intervention. Included in 
the scope of the invention are oligonucleotide sequences, antisense RNA 
and DNA molecules, and PNAs. 

Another aspect of the subject invention is to provide for 
hybridization or PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding NSPLP or closely related 
molecules. The specificity of the probe, whether it is made from a 
highly specific region, eg, 10 unique nucleotides in the 5' regulatory 
region, or a less specific region, eg, especially in the 3' region, and 
the stringency of the hybridization or amplification (maximal, high, 
intermediate or low) will determine whether the probe identifies only 
naturally occurring sequences encoding NSPLP, alleles or related 
sequences . 

Probes may also be used for the detection of related sequences and 
should preferably contain at least 50% of the nucleotides from any of 
these NSPLP encoding sequences. The hybridization probes of the subject 
invention may be derived from the nucleotide sequence of SEQ ID NO: 2 or 
from genomic sequence including promoter, enhancer elements and introns 
of the naturally occurring NSPLP. Hybridization probes may be labeled by 
a variety of reporter groups, including radionuclides such as 32P or 35S, 
or enzymatic labels such as alkaline phosphatase coupled to the probe via 
avidin/biotin coupling systems, and the like. 

Other means for producing specific hybridization probes for DNAs 
encoding NSPLP include the cloning of nucleic acid sequences encoding 
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NSPLP or NSPLP derivatives into vectors for the production of roRNA 
probes. Such vectors are known in the art and are commercially available 
and may be used to synthesize RNA probes in vitro by means of the 
addition of the appropriate RNA polymerase as T7 or SP6 RNA polymerase 
and the appropriate radioactively labeled nucleotides. 

Polynucleotide sequences encoding NSPLP may be used for the 
diagnosis of conditions or diseases with which the expression of NSPLP is 
associated. For example, polynucleotide sequences encoding NSPLP may be 
used in hybridization or PCR assays of fluids or tissues from biopsies to 
detect NSPLP expression. The form of such qualitative or quantitative 
methods may include Southern or northern analysis, dot blot or other 
membrane -based technologies; PCR technologies; dip stick, pIN, chip and 
ELISA technologies. All of these techniques are well known in the art 
and are the basis of many commercially available diagnostic kits. 

The nucleotide sequences encoding NSPLP disclosed herein provide 
the basis for assays that detect activation or induction associated with 
cancer and neurodegenerative disorders, such as ALS. The nucleotide 
sequence encoding NSPLP may be labeled by methods known in the art and 
added to a fluid or tissue sample from a patient under conditions 
suitable for the formation of hybridization complexes. After an 
incubation period, the sample is washed with a compatible fluid which 
optionally contains a dye (or other label requiring a developer) if the 
nucleotide has been labeled with an enzyme. After the compatible fluid 
is rinsed off, the dye is quantitated and compared with a standard. If 
the amount of dye in the biopsied or extracted sample is significantly 
elevated over that of a comparable control sample, the nucleotide 
sequence has hybridized with nucleotide sequences in the sample, and the 
presence of elevated levels of nucleotide sequences encoding NSPLP in the 
sample indicates the presence of the associated disease. 

Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regime in animal studies, in clinical 
trials, or in monitoring the treatment of an individual patient. In 
order to provide a basis for the diagnosis of disease, a normal or 
standard profile for NSPLP expression must be established. This is 
accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with NSPLP, or a portion thereof, under 
conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained for 
normal subjects with a dilution series of NSPLP run in the same 
experiment where a known amount of a substantially purified NSPLP is 
used. Standard values obtained from normal samples may be compared with 
values obtained from samples from patients afflicted with 
NSPLP-associated diseases. Deviation between standard and subject values 
is used to establish the presence of disease. 
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Once disease is established, a therapeutic agent is administered 
and a treatment profile is generated. Such assays may be repeated on a 
regular basis to evaluate whether the values in the profile progress 
toward or return to the normal or standard pattern- Successive treatment 
profiles may be used to show the efficacy of treatment over a period of 
several days or several months. 

PCR, as described in US Patent Nos. 4,683,195 and 4, 965,188, 
provides additional uses for oligonucleotides based upon the NSPLP 
sequence. Such oligomers are generally chemically synthesized, but they 
may be generated enzymatically or produced from a recombinant source. 
Oligomers generally comprise two nucleotide sequences, one with sense 
orientation (S'->3') and one with antisense (3'<-5'), employed under 
optimized conditions for identification of a specific gene or condition. 
The same two oligomers, nested sets of oligomers, or even a degenerate 
pool of oligomers may be employed under less stringent conditions for 
detection and/or quantitation of closely related DNA or RNA sequences. 

Additionally, methods which may be used to quantitate the 
expression of a particular molecule include radiolabeling (Melby PC et al 
1993 J Immunol Methods 159:235-44) or biotinylating (Duplaa C et al 1993 
Anal Biochem 229-36) nucleotides, coamplif ication of a control nucleic 
acid, and standard curves onto which the experimental results are 
interpolated. Quantitation of multiple samples may be speeded up by 
running the assay in an ELISA format where the oligomer of interest is 
presented in various dilutions and a spectrophotometry or colorimetric 
response gives rapid quantitation. For example, the presence of a 
relatively high amount of NSPLP in extracts of biopsied tissues may 
indicate the onset of cancer. A definitive diagnosis of this type may 
allow health professionals to begin aggressive treatment and prevent 
further worsening of the condition. Similarly, further assays can be 
used to monitor the progress of a patient during treatment. Furthermore, 
the nucleotide sequences disclosed herein may be used in molecular 
biology techniques that have not yet been developed, provided the new 
techniques rely on properties of nucleotide sequences that are currently 
known such as the triplet genetic code, specific base pair interactions, 
and the like. 
Therapeutic Uaa 

Based upon its homology to genes encoding NSP-like proteins and its 
expression profile, polynucleotide sequences encoding NSPLP disclosed 
herein may be useful in the treatment of conditions such as cancer and 
neurodegenerative disorders, such as ALS. 

Expression vectors derived from retroviruses, adenovirus, herpes or 
vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ-, tissue or cell 
population. Methods which are well known to those skilled in the art can 
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be used to construct recombinant vectors which will express antisense 
polynucleotides of the gene encoding NSPLP. See, for example, the 
techniques described in Sambrook et al (supra) and Ausubel et al (supra) . 

The polynucleotides comprising full length cDNA sequence and/or its 
regulatory elements enable researchers to use sequences encoding NSPLP as 
an investigative tool in sense (Youssoufian H and HF Lodish 1993 Mol Cell 
Biol 13:98-104) or antisense {Eguchi et al (1991) Annu Rev Biochem 
60:631-652) regulation of gene function. Such technology is now well 
known in the art, and sense or antisense oligomers, or larger fragments, 
can be designed from various locations along the coding or control 
regions . 

Genes encoding NSPLP can be turned off by transfecting a cell or 
tissue with expression vectors which express high levels of a desired 
NSPLP-encoding fragment. Such constructs can flood cells with 
untranslatable sense or antisense sequences. Even in the absence of 
integration into the DNA, such vectors may continue to transcribe RNA 
molecules until all copies are disabled by endogenous nucleases. 
Transient expression may last for a month or more with a non-replicating 
vector (Mettler I, personal communication) and even longer if appropriate 
replication elements are part of the vector system. 

As mentioned above, modifications of gene expression can be 
obtained by designing antisense molecules, DNA, RNA or PNA, to the 
control regions of gene encoding NSPLP, ie, the promoters, enhancers, and 
introns. Oligonucleotides derived from the transcription initiation 
site, eg, between -10 and +10 regions of the leader sequence, are 
preferred. The antisense molecules may also be designed to block 
translation of mRNA by preventing the transcript from binding to 
ribosomes. Similarly, inhibition can be achieved using "triple helix" 
base-pairing methodology. Triple helix pairing compromises the ability 
of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA were reviewed by Gee JE et al (In: Huber BE 
and BI Carr (1994) MolCCUlar and Immunologic Approaches. Futura 
Publishing Co, Mt Kisco NY) . 

Ribozymes are enzymatic RNA molecules capable of catalyzing the 
specific cleavage of RNA. The mechanism of ribozyme action involves 
sequence-specific hybridization of the ribozyme molecule to complementary 
target RNA, followed by endonucleolytic cleavage. Within the scope of 
the invention are engineered hammerhead motif ribozyme molecules that can 
specifically and efficiently catalyze endonucleolytic cleavage of 
sequences encoding NSPLP. 

Specific ribozyme cleavage sites within any potential RNA target 
are initially identified by scanning the target molecule for ribozyme 
cleavage sites which include the following sequences, GUA, GUO and GUC. 
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Once identified, short RNA sequences of between 15 and 20 ribonucleotides 
corresponding to the region of the target gene containing the cleavage 
site may be evaluated for secondary structural features which may render 
the oligonucleotide inoperable. The suitability of candidate targets may 
5 also be evaluated by testing accessibility to hybridization with 

complementary oligonucleotides using ribonuclease protection assays. 

Antisense molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of RNA molecules. These 
include techniques for chemically synthesizing oligonucleotides such as 

10 solid phase phosphoramidite chemical synthesis. Alternatively, RNA 

molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding NSPLP. Such DNA sequences may be incorporated into a 
wide variety of vectors with suitable RNA polymerase promoters such as T7 
or SP6. Alternatively, antisense cDNA constructs that synthesize 

15 antisense RNA constitutively or inducibly can be introduced into cell 

lines, cells or tissues. 

RNA molecules may be modified to increase intracellular stability 
and half-life. Possible modifications include, but are not limited to, 
the addition of flanking sequences at the 5' and/or 3' ends of the 

20 molecule or the use of phosphorothioate or 2' O-methyl rather than 

phosphodiesterase linkages within the backbone of the molecule. This 
concept is inherent in the production of PNAs and can be extended in all 
of these molecules by the inclusion of nontraditional bases such as 
inosine, queosine and wybutosine as well as acetyl-, methyl-, thio- and 

2S similarly modified forms of adenine, cytidine, guanine, thymine, and 

uridine which are not as easily recognized by endogenous endonucleases . 

Methods for introducing vectors into cells or tissues include those 
methods discussed infra and which are equally suitable for in vivo , in 
Yitro and vivo therapy. For viyo therapy, vectors are introduced 

30 into stem cells taken from the patient and clonally propagated for 

autologous transplant back into that same patient is presented in US 
Patent Nos. 5,399,4 93 and 5,4 37,994, disclosed herein by reference. 
Delivery by transfection and by liposome are quite well known in the art. 
Furthermore, the nucleotide sequences for NSPLP disclosed herein 

35 may be used in molecular biology techniques that have not yet been 

developed, provided the new techniques rely on properties of nucleotide 
sequences that are currently known, including but not limited to such 
properties as the triplet genetic code and specific base pair 
interactions. 

40 Detection and Mapping of Related Polvnuelnotid a Saoueneaa 

The nucleic acid sequence for NSPLP can also be used to generate 
hybridization probes for mapping the naturally occurring genomic 
sequence. The sequence may be mapped to a particular chromosome or to a 
specific region of the chromosome using well known techniques. These 
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include in, situ hybridization to chromosomal spreads, flow-sorted 
chromosomal preparations, or artificial chromosome constructions such as 
yeast artificial chromosomes, bacterial artificial chromosomes, bacterial 
PI constructions or single chromosome cDNA libraries as reviewed in Price 
CM (1993; Blood Rev 7:127-34) and Trask BJ (1991; Trends Genet 7:149-54). 

The technique of fluorescent in situ hybridization of chromosome 
spreads has been described, among other places, in Verma et al (1988) 
Human Chromosomes: & Manual OL £a£l£ Techniques. Pergamon Press, New 
York NY. Fluorescent in situ hybridization of chromosomal preparations 
and other physical chromosome mapping techniques may be correlated with 
additional genetic map data. Examples of genetic map data can be found 
in the 1994 Genome Issue of Science (265:1981f). Correlation between the 
location of the gene encoding NSPLP on a physical chromosomal map and a 
specific disease (or predisposition to a specific disease) may help 
delimit the region of DNA associated with that genetic disease. The 
nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected 
individuals. 

In situ hybridization of chromosomal preparations and physical 
mapping techniques such as linkage analysis using established chromosomal 
markers may be used for extending genetic maps. For example an sequence 
tagged site based map of the human genome was recently published by the 
Whitehead-MIT Center for Genomic Research (Hudson TJ et al(1995) Science 
270:1945-1954). Often the placement of a gene on the chromosome of 
another mammalian species such as mouse (Whitehead Institute/MIT Center 
for Genome Research, Genetic Map of the Mouse, Database Release 10, April 
28, 1995) may reveal associated markers even if the number or arm of a 
particular human chromosome is not known. New sequences can be assigned 
to chromosomal arms, or parts thereof, by physical mapping. This 
provides valuable information to investigators searching for disease 
genes using positional cloning or other gene discovery techniques. Once 
a disease or syndrome, such as ataxia telangiectasia (AT), has been 
crudely localized by genetic linkage to a particular genomic region, for 
example, AT to llq22-23 (Gatti et al (1988) Nature 336:577-580), any 
sequences mapping to that area may represent associated or regulatory 
genes for further investigation. The nucleotide sequence of the subject 
invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc. among normal, carrier or 
affected individuals. 
Pharmaceutical Compositiona 

The present invention relates to pharmaceutical compositions which 
may comprise nucleotides, proteins, antibodies, agonists, antagonists, or 
inhibitors, alone or in combination with at least one other agent, such 
as stabilizing compound, which may be administered in any sterile, 
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biocompatible pharmaceutical carrier, including, but not limited to, 
saline, buffered saline, dextrose, and water. Any of these molecules can 
be administered to a patient alone, or in combination with other agents, 
drugs or hormones, in pharmaceutical compositions where it is mixed with 
excipient(s) or pharmaceutical ly acceptable carriers. In one embodiment 
of the present invention, the pharmaceutically acceptable carrier is 
pharmaceutically inert. 

Administration of pharmaceutical compositions is accomplished 
orally or parenterally . Methods of parenteral delivery include topical, 
intra-arterial (directly to the tumor), intramuscular, subcutaneous, 
intramedullary, intrathecal, intraventricular, intravenous, 
intraperitoneal, or intranasal administration. In addition to the active 
ingredients, these pharmaceutical compositions may contain suitable 
pharmaceutically acceptable carriers comprising excipients and 
auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. Further details on 
techniques for formulation and administration may be found in the latest 
edition of "Remington's Pharmaceutical Sciences" (Maack Publishing Co, 
Easton PA) . 

Pharmaceutical compositions for oral administration can be 
formulated using pharmaceutically acceptable carriers well known in the 
art in dosages suitable for oral administration. Such carriers enable 
the pharmaceutical compositions to be formulated as tablets, pills, 
dragees, capsules, liquids, gels, syrups, slurries, suspensions and the 
like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through 
combination of active compounds with solid excipient, optionally grinding 
a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. 
Suitable excipients are carbohydrate or protein fillers such as sugars, 
including lactose, sucrose, mannitol, or sorbitol; starch from corn, 
wheat, rice, potato, or other plants; cellulose such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethyl cellulose; and gums 
including arabic and tragacanth; and proteins such as gelatin and 
collagen. If desired, disintegrating or solubilizing agents may be 
added, such as the cross -linked polyvinyl pyrrol idone, agar, alginic 
acid, or a salt thereof, such as sodium alginate. 

Dragee cores are provided with suitable coatings such as 
concentrated sugar solutions, which may also contain gum arabic, talc, 
polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium 
dioxide, lacquer solutions, and suitable organic solvents or solvent 
mixtures. Dyestuffs or pigments may be added to the tablets or dragee 
coatings for product identification or to characterize the quantity of 
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active compound, ie, dosage. 

Pharmaceutical preparations which can be used orally include 
push-fit capsules made of gelatin, as well as soft, sealed capsules made 
of gelatin and a coating such as glycerol or sorbitol. Push-fit capsules 
can contain active ingredients mixed with a filler or binders such as 
lactose or starches, lubricants such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be 
dissolved or suspended in suitable liquids, such as fatty oils, liquid 
paraffin, or liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations for parenteral administration include 
aqueous solutions of active compounds. For injection, the pharmaceutical 
compositions of the invention may be formulated in aqueous solutions, 
preferably in physiologically compatible buffers such as Hanks 1 s 
solution, Ringer's solution, or physiologically buffered saline. Aqueous 
injection suspensions may contain substances which increase the viscosity 
of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be 
prepared as appropriate oily injection suspensions. Suitable lipophilic 
solvents or vehicles include fatty oils such as sesame oil, or synthetic 
fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. 
Optionally, the suspension may also contain suitable stabilizers or 
agents which increase the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the 
particular barrier to be permeated are used in the formulation. Such 
penetrants are generally known in the art. 
Mamifaetaira and fitoraao 

The pharmaceutical compositions of the present invention may be 
manufactured in a manner that known in the art, eg, by means of 
conventional mixing, dissolving, granulating, dragee -ma king, levigating, 
emulsifying, encapsulating, entrapping or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be 
formed with many acids, including but not limited to hydrochloric, 
sulfuric, acetic, lactic, tartaric, malic, succinic, etc. Salts tend to 
be more soluble in aqueous or other protonic solvents that are the 
corresponding free base forms. In other cases, the preferred preparation 
may be a lyophiiized powder in lmM-50 mM histidine, G.1%-2% sucrose, 
2%-7% mannitol at a pH range of 4.5 to 5.5 that is combined with buffer 
prior to use. 

After pharmaceutical compositions comprising a compound of the 
invention formulated in a acceptable carrier have been prepared, they can 
be placed in an appropriate container and labeled for treatment of an 
indicated condition. For administration of NSPLP, such labeling would 
include amount, frequency and method of administration. 
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Therapeutically Eff active Pggg 

Pharmaceutical compositions suitable for use in the present 
invention include compositions wherein the active ingredients are 
contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those 
skilled in the art. 

For any compound, the therapeutically effective dose can be 
estimated initially either in cell culture assays, eg, of neoplastic 
cells, or in animal models, usually mice, rabbits, dogs, or pigs. The 
animal model is also used to achieve a desirable concentration range and 
route of administration. Such information can then be used to determine 
useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of protein 
or its antibodies, antagonists, or inhibitors which ameliorate the 
15 symptoms or condition. Therapeutic efficacy and toxicity of such 

compounds can be determined by standard pharmaceutical procedures in cell 
cultures or experimental animals, eg, ED50 (the dose therapeutically 
effective in 50% of the population) and LD50 (the dose lethal to 50% of 
the population) . The dose ratio between therapeutic and toxic effects is 
20 the therapeutic index, and it can be expressed as the ratio, LD50/ED50. 

Pharmaceutical compositions which exhibit large therapeutic indices are 
preferred. The data obtained from cell culture assays and animal studies 
is used in formulating a range of dosage for human use. The dosage of 
such compounds lies preferably within a range of circulating 
25 concentrations that include the ED50 with little or no toxicity. The 

dosage varies within this range depending upon the dosage form employed, 
sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of 
the patient to be treated. Dosage and administration are adjusted to 
30 provide sufficient levels of the active moiety or to maintain the desired 

effect. Additional factors which may be taken into account include the 
severity of the disease state, eg, tumor size and location; age, weight 
and gender of the patient; diet, time and frequency of administration, 
drug combination (s) , reaction sensitivities, and tolerance/response to 
35 therapy. Long acting pharmaceutical compositions might be administered 

every 3 to 4 days, every week, or once every two weeks depending on 
half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from 0.1 to 100,000 micrograms, up 
to a total dose of about 1 g, depending upon the route of administration. 
Guidance as to particular dosages and methods of delivery is provided in 
the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for 
nucleotides than for proteins or their inhibitors. Similarly, delivery 
of polynucleotides or polypeptides will be specific to particular cells. 
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conditions, locations, etc. 

It is contemplated, for example, that NSPLP or an NSPLP derivative 
can be delivered in a suitable formulation to block the progression of 
cancerous cell growth or of neuronal degeneration. Similarly, 
administration of NSPLP antagonists may also inhibit the activity or 
shorten the lifespan of this protein. 

The examples below are provided to illustrate the subject invention 
and are not included for the purpose of limiting the invention. 

INDUSTRIAL APPLICABILITY 
I Construction of cDNA Lihrariaa 
THF-3, 

THP-1 is a human leukemic cell line derived from the blood of a 1- 
year-old boy with acute monocytic leukemia. The THP-1 cells represent 
monocytes. The THP-1 cDNA library was custom constructed by Stratagene 
(Stratagene, 11099 M. Torrey Pines Rd., La Jolla, CA 92037) essentially 
as described below. 

Stratagene prepared the cDNA library using oligo d(T) priming. 
Synthetic adapter oligonucleotides were ligated onto the cDNA molecules 
enabling them to be inserted into the Uni-ZAP™ vector system 
(Stratagene) . This allowed high efficiency unidirectional (sense 
orientation) lambda library construction and the convenience of a plasmid 
system with blue/white color selection to detect clones with cDKA 
insertions. 

The quality of the cDNA library was screened using DNA probes, and 
then, the pBluescript® phagemid (Stratagene) was excised. This ph a gem id 
allows the use of a plasmid system for easy insert characterization, 
sequencing, site-directed mutagenesis, the creation of unidirectional 
deletions and expression of fusion polypeptides. Subsequently, the 
custom-constructed library phage particles were infected into E. coli 
host strain XLl-Blue® (Stratagene) . The high transformation efficiency 
of this bacterial strain increases the probability that the cDNA library 
will contain rare, under-represented clones. Alternative unidirectional 
vectors include, but are not limited to, pcDNAI (Invitrogen, San Diego 
CA) and pSHlox-1 (Novagen, Madison WI) . 

The human spleen cell cDNA library was custom constructed by 
Stratagene (catalogue # 937205. Stratagene, La Jolla CA) . The starting 
cell population is mixed, having been obtained from fetal spleens which 
have a diverse cell population. Furthermore, the fetal spleens have been 
pooled from different sources. Poly(A+) RNA (mRNA) was purified from the 
spleen cells. cDNA was synthesized from the mRNA. Synthetic adaptor 
oligonucleotides were ligated onto cDNA ends enabling its insertion into 
Uni-ZAP* vector system (Stratagene), allowing high efficiency 
unidirectional (sense orientation) lambda library construction and the 
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convenience of a plasmid system with blue/white color selection to detect 
clones with cDNA insertions. Alternative unidirectional vectors are 
pcDNAl (invitrogen, San Diego CAl and pSHlox-1 (Novagen, Madison WI) . 

xi Isolation of cPMA Clgn%s 

THP-1 

The phagemid forms of individual cDNA clones were obtained by the 
in vivo excision process, in which the host bacterial strain was co- 
infected with both the library phage and an fl helper phage. 
Polypeptides or enzymes derived from both the library-containing phage 
and the helper phage nicked the DNA, initiated new DNA synthesis from 
defined sequences on the target DNA/ and created a smaller, single 
stranded circular phagemid DNA molecule that included all DNA sequences 
of the pBluescript phagemid and the cDNA insert. The phagemid DNA was 
released from the cells and purified, and used to reinfect fresh host 
cells (SOLR, Stratagene) where double -stranded phagemid DNA was produced. 
Because the phagemid carries the gene for {*- lactamase, the newly 
transformed bacteria were selected on medium containing ampicillin. 

An alternate method of purifying phagemid has recently become 
available. It utilizes the Miniprep Kit (Catalog No. 77468, available 
from Advanced Genetic Technologies Corp., 19212 Orbit Drive, 
Gaithersburg, Maryland) . This kit is in the 96-well format and provides 
enough reagents for 960 purifications. Each kit is provided with a 
recommended protocol, which has been employed except for the following 
changes. First, the 96 wells are each filled with only 1 ml of sterile 
terrific broth with carbenicillin at 25 mg/L and glycerol at 0.4%. After 
the wells are inoculated, the bacteria are cultured for 24 hours and 
lysed with 60 ^1 of lysis buffer. A centrif ugation step (2900 rpm for 5 
minutes) is performed before the contents of the block are added to the 
primary filter plate. The optional step of adding isopropanol to TRIS 
buffer is not routinely performed. After the last step in the protocol, 
samples are transferred to a Beckman 96-well block for storage. 

Phagemid DNA was also purified using the QIAWELL-8 Plasmid 
Purification System from the QIAGEN® DNA Purification System (01 AG EN Inc, 
Chatsworth CA) . This product provides a convenient, rapid and reliable 
high-throughput method for lysing the bacterial cells and isolating 
highly purified phagemid DNA using QIAGEN anion-exchange resin particles 
with EMPORE™ membrane technology from 3M in a multiwell format. The DNA 
was eluted from the purification resin and prepared for DNA sequencing 
and other analytical manipulations. 

The phagemid forms of individual cDNA clones were obtained by the 
in vivo excision process, in which the host bacterial strain was co- 
infected with both the library phage and an fl helper phage. 
Polypeptides or enzymes derived from both the library-containing phage 
and the helper phage nicked the DNA, initiated new DNA synthesis from 
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defined sequences on the target DNA, and created a smaller, single 
stranded circular phagemid DNA molecule that included all DNA sequences 
of the pBluescript phagemid and the cDNA insert. The phagemid DNA was 
released from the cells and purified, and used to reinfect fresh host 
cells (SOLR, Stratagene) where double-stranded phagemid DNA was produced. 
Because the phagemid carries the gene for fi-lactamase, the newly 
transformed bacteria were selected on medium containing ampicillin. 

Phagemid DNA was also purified using the QIAWELL-8 Plasmid 
Purification System from the QIAGEN® DNA Purification System (QIAGEN Inc, 
Chatsworth CA) . This product provides a convenient, rapid and reliable 
high-throughput method for lysing the bacterial cells and isolating 
highly purified phagemid DNA using QIAGEN anion-exchange resin particles 
with EMPORE™ membrane technology from 3M in a raultiwell format. The DNA 
was eluted from the purification resin and prepared for DNA sequencing 
and other analytical manipulations. 

Ill Homology S«archina of cDNA ClonM ind Thnir Dodacad Protaina 

Each cDNA was compared to sequences in GenBank using a search 
algorithm developed by Applied Biosystems and incorporated into the 
INHERIT' 670 Sequence Analysis System. In this algorithm, Pattern 
Specification Language (TRW Inc, Los Angeles CA) was used to determine 
regions of homology. The three parameters that determine how the 
sequence comparisons run were window size, window offset, and error 
tolerance. Using a combination of these three parameters, the DNA 
database was searched for sequences containing regions of homology to the 
query sequence, and the appropriate sequences were scored with an initial 
value . Subsequently, these homologous regions were examined using dot 
matrix homology plots to distinguish regions of homology from chance 
matches. Smith -Waterman alignments were used to display the results of 
the homology search. 

Peptide and protein sequence homologies were ascertained using the 
INHERIT- 670 Sequence Analysis System in a way similar to that used in 
DNA sequence homologies- Pattern Specification Language and parameter 
windows were used to search protein databases for sequences containing 
regions of homology which were scored with an initial value. Dot -matrix 
homology plots were examined to distinguish regions of significant 
homology from chance matches. 

BLAST, which stands for Basic Local Alignment Search Tool (Altschul 
SF (1993) J Mol Evol 36:290-300; Altschul, . SF et al (1990) J Mol Biol 
215:403-10), was used to search for local sequence alignments. BLAST 
produces alignments of both nucleotide and amino acid sequences to 
determine sequence similarity. Because of the local nature of the 
alignments, BLAST is especially useful in determining exact matches or in 
identifying homologs. BLAST is useful for matches which do not contain 
gaps. The fundamental unit of BLAST algorithm output is the High-scoring 
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Segment Pair (HSP) . 

An HSP consists of two sequence fragments of arbitrary but equal 
lengths whose alignment is locally maximal and for which the alignment 
score meets or exceeds a threshold or cutoff score set by the user. The 
BLAST approach is to look for HSPs between a query sequence and a 
database sequence, to evaluate the statistical significance of any 
matches found, and to report only those matches which satisfy the 
user-selected threshold of significance. The parameter E establishes the 
statistically significant threshold for reporting database sequence 
matches. E is interpreted as the upper bound of the expected frequency 
of chance occurrence of an HSP (or set of HSPs) within the context of the 
entire database search. Any database sequence whose match satisfies E is 
reported in the program output. 

IV Northern Analysis 

Northern analysis is a laboratory technique used to detect the 
presence of a transcript of a gene and involves the hybridization of a 
labelled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound (Sambrook et al. supra). 

Analogous computer techniques using BLAST (Altschul SF 1993 and 
1990, supra) are used to search for identical or related molecules in 
nucleotide databases such as GenBank or the LIEESEQ m database (Incyte, 
Palo Alto CA) . This analysis is much faster than multiple, membrane- 
based hybridizations- In addition, the sensitivity of the computer 
search can be modified to determine whether any particular match is 
categorized as exact or homologous. 

The basis of the search is the product score which is defined as: 
% sequence identity x % maximum BLAST score* 
100 

and it takes into acccount both the degree of similarity between two 
sequences and the length of the sequence match. For example, with a 
product score of 40, the match will be exact within a 1-2% error; and at 
70, the match will be exact. Homologous molecules are usually identified 
by selecting those which show product scores between 15 and 40, although 
lower scores may identify related molecules. 

V Extension of NSPLP-Encodincr Polvmiclaft tldtoa to Full Lancrfch or to 
Recover Regulatory Elomonf 

Full length NSPLP-encoding nucleic acid sequence (SEQ ID NO: 2) is 
used to design oligonucleotide primers for extending a partial nucleotide 
sequence to full length or for obtaining 5' sequences from genomic 
libraries. One primer is synthesized to initiate extension in the 
antisense direction (XLR) and the other is synthesized to extend sequence 
in the sense direction (XLF) . Primers allow the extension of the known 
NSPLP-encoding sequence "outward" generating amplicons containing new, 
unknown nucleotide sequence for the region of interest {US Patent 
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Application 08/48*7, 112, filed June 7, 1995, specifically incorporated by 
reference) . The initial primers are designed from the cDNA using OLIGO* 
4.06 Primer Analysis Software (National Biosciences), or another 
appropriate program, to be 22-30 nucleotides in length, to have a GC 
content of 50% or more, and to anneal to the target sequence at 
temperatures about 6B°-72° C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations is avoided. 

The original, selected cDNA libraries, or a human genomic library 
are used to extend the sequence; the latter is most useful to obtain 5* 
upstream regions. If more extension is necessary or desired, additional 
sets of primers are designed to further extend the known region. 

By following the instructions for the XL-PCR kit (Perkin Elmer) and 
thoroughly mixing the enzyme and reaction mix, high fidelity 
amplification is obtained. Beginning with 4 0 pmol of each primer and the 
recommended concentrations of all other components of the kit, PCR is 
performed using the Peltier Thermal Cycler (PTC200; MJ Research, 
Watertown MA) and the following parameters: 



Step 1 94° C for 1 min (initial denaturation) 

Step 2 65° C for 1 min 

Step 3 68° C for 6 min 

Step 4 94° C for 15 sec 

Step 5 65" C for 1 min 

Step 6 6B° C for 7 min 

Step 7 Repeat step 4-6 for 15 additional cycles 

Step 8 94° C for 15 sec 

Step 9 65° C for 1 min 

Step 10 68° C for 7:15 min 

Step 11 Repeat step 8-10 for 12 cycles 

Step 12 72° C for 8 min 

Step 13 4° C (and holding) 



A 5-10 m! aliquot of the reaction mixture is analyzed by 
electrophoresis on a low concentration (about 0.6-0.8%) agarose mini-gel 
to determine which reactions were successful in extending the sequence. 
Bands thought to contain the largest products were selected and cut " out 
of the gel. Further purification involves using a commercial gel 
extraction method such as QIA^icR™ (QIAGEN Inc) . After recovery of the 
DNA, XI enow enzyme was used to trim single-stranded, nucleotide overhangs 
creating blunt ends which facilitate religation and cloning. 

After ethanol precipitation, the products are redissolved in 13 fxl 
of ligation buffer, 1/il T4-DNA ligase (15 units) and 1^1 T4 
polynucleotide kinase are added, and the mixture is incubated at room 
temperature for 2-3 hours or overnight at 16° C. Competent ELl. £Q±L cells 
(in 40 m! of appropriate media) are transformed with 3 m1 of ligation 
mixture and cultured in 80 pi of SOC medium (Sambrook J et al, supra) . 
After incubation for one hour at 37° C, the whole transformation mixture 
is plated on Luria Bertani (LB) -agar (Sambrook J et al, supra) containing 
2xCarb. The following day, several colonies are randomly picked from 
each plate and cultured in 150 m1 of liquid LB/2xCarb medium placed in an 
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individual well of an appropriate, commercially-available, sterile 96- 
well microtiter plate. The following day, 5 nl of each overnight culture 
is transferred into a non-sterile 96-well plate and after dilution 1:10 
with water, 5 pi of each sample is transferred into a PCR array. 

For PCR amplification, 18 pi of concentrated PCR reaction mix 
(3.3x) containing 4 units of rTth DNA polymerase, a vector primer and one 
or both of the gene specific primers used for the extension reaction are 
added to each well. Amplification is performed using the following 
conditions: 



Step 1 94° C for 60 sec 

Step 2 94 Q C for 20 sec 

Step 3 55° C for 30 sec 

Step 4 72° C for 90 sec 

Step 5 Repeat steps 2-4 for an additional 29 cycles 

Step 6 72° C for 180 sec 

Step 7 4° C (and holding) 



Aliquots of the PCR reactions are run on agarose gels together 
with molecular weight markers. The sizes of the PCR products are 
compared to the original partial cDNAs, and appropriate clones are 
selected, ligated into plasmid and sequenced. 
VI Tjibalina and Uao of Hybridization Proboa 

Hybridization probes derived from SEQ ID NO: 2 are employed to 
screen cDNAs, genomic DNAs or mRNAs. Although the labeling of 
oligonucleotides, consisting of about 20 base-pairs, is specifically 
described, essentially the same procedure is used with larger cDNA 
fragments. Oligonucleotides are designed using state-of-the-art software 
such as OLIGO 4.06 (National Biosciences), labeled by combining 50 pmol 
of each oligomer and 250 mCi of [v- 32 P] adenosine triphosphate (Amersham, 
Chicago IL) and T4 polynucleotide kinase (DuPont NEN*, Boston MA) . The 
labeled oligonucleotides are substantially purified with Sephadex G-25 
super fine resin column (Pharmacia) . A portion containing 10* counts per 
minute of each of the sense and antisense oligonucleotides is used in a 
typical membrane based hybridization analysis of human genomic DNA 
digested with one of the following endonucleases (Ase I, Bgl II, Eco RI, 
Pst I, Xba 1, or Pvu II; DuPont NEN 9 ). 

The DNA from each digest is fractionated on a 0.7 percent agarose 
gel and transferred to nylon membranes (Nytran Plus, Schleicher & 
Schuell, Durham NH) . Hybridization is carried out for 16 hours at 40°C. 
To remove nonspecific signals, blots are sequentially washed at room 
temperature under increasingly stringent conditions up to 0.1 x saline 
sodium citrate and 0.5% sodium dodecyl sulfate. After XOMAT AR" 1 film 
(Kodak, Rochester NY) is exposed to the blots in a Phosphoimager cassette 
(Molecular Dynamics, Sunnyvale CA) for several hours, hybridization 
patterns are compared visually. 

vii AntiattMg MolftSttlflg 
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The NSPLP-encoding sequence, or any part thereof, is used to 
inhibit in vivo or in vitro expression of naturally occurring NSPLP. 
Although use of antisense oligonucleotides, comprising about 20 base- 
pairs, is specifically described, essentially the same procedure is used 
with larger cDNA fragments. An oligonucleotide based on the coding 
sequences of NSPLP, as shown in Figs. 1A, IB, 2A # and 2B is used to 
inhibit expression of naturally occurrinq NSPLP. The complementary 
oligonucleotide is designed from the most unique 5* sequence as shown in 
Figures 1A, IB, 2A, and 2B and used either to inhibit transcription by 
preventing promoter binding to the upstream nontranslated sequence or 
translation of an NSPLP-encoding transcript by preventing the ribosome 
from binding. Using an appropriate portion of the leader and 5' sequence 
of SEQ ID NO:2, an effective antisense oligonucleotide includes any 15-20 
nucleotides spanning the region which translates into the signal or early 
coding sequence of the polypeptide as shown in Figures 1A, IB, 2A, and 
2B. 

VIII BrarM^on of NSPLP 

Expression of the NSPLP is accomplished by subcloning the cDNAs 
into appropriate vectors and transfecting the vectors into host cells. 
In this case, the cloning vector, pSport, previously used for the 
generation of the cDNA library is used to express NSPLP in £. coli . 
Upstream of the cloning site, this vector contains a promoter for 
fi-galactosidase, followed by sequence containing the amino-terminal Met 
and the subsequent 7 residues of fi-galactosidase. Immediately following 
these eight residues is a bacteriophage promoter useful for transcription 
and a linker containing a number of unique restriction sites. 

Induction of an isolated, transfected bacterial strain with IPTG 
using standard methods produces a fusion protein which consists of the 
first seven residues of B-galactosidase, about 5 to 15 residues of 
linker, and the full length NSPLP-encoding sequence. The signal sequence 
directs the secretion of NSPLP into the bacterial growth media which can 
be used directly in the following assay for activity. 

ix HSFIiP Activity 

NSPLP' s ER targeting activity can be assessed by a method of van 
de Velde et al (1994, supra) . Microsomes are collected from cells 
expressing NSPLP by a 100,000 g spin in a method described by Verboomen H 
et al (1992 Biochem J 286:591-596). After treatment with 0.5 M KC1 and 
centrif ugation the pellet is resuspended and subject to gel 
electrophoresis. Western blot analysis using antibodies to NSPLP reveals 
the presence of NSPLP in the ER membrane. 



X Production of NSPLP Specific Antibodies 



34 

SUBSTITUTE SHEET (RULE 26) 



WO 98/06841 PCT/US97/13469 

NSPLP substantially purified using PAGE electrophoresis (Sambrook, 
supra > is used to immunize rabbits arid to produce antibodies using 
standard protocols. The amino acid sequence translated from NSPLP is 
analyzed using DNAStar software (DNAStar Inc) to determine regions of 
high immunogenicity and a corresponding oligopolypeptide is synthesized 
and used to raise antibodies by means known to those of skill in the art. 
Analysis to select appropriate epitopes, such as those near the 
C-terminus or in hydrophilic regions (shown in Figs. 7 and 8) is 
described by Ausubel FM et al (supra) . 

Typically, the oligopeptides are 15 residues in length, 
synthesized using an Applied Biosystems Peptide Synthesizer Model 431A 
using fmoc-chemistry, and coupled to keyhole limpet hemocyanin (KLH, 
Sigma) by reaction with M-maleimidobenzoyl-N-hydroxysuccinimide ester 
(HBS; Ausubel FM et al, supra) . Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant. The resulting 
antisera are tested for antipeptide activity, for example, by binding the 
peptide to plastic, blocking with 1% BSA, reacting with rabbit antisera, 
washing, and reacting with radioiodinated, goat anti-rabbit IgG. 

XI Purification of Naturally QcctirrinQ NSPLP Using Specific 

AntifeodiM 

Naturally occurring or recombinant NSPLP is substantially purified 
by immunoaf f inity chromatography using antibodies specific for NSPLP. An 
immunoaf f inity column is constructed by covalently coupling NSPLP 
antibody to an activated chromatographic resin such as CnBr-activated 
Sepharose (Pharmacia Biotech). After the coupling, the resin is blocked 
and washed according to the manufacturer's instructions. 

Media containing NSPLP is passed over the immunoaf f inity column, 
and the column is washed under conditions that allow the preferential 
absorbance of NSPLP (eg, high ionic strength buffers in the presence of 
detergent) . The column is eluted under conditions that disrupt 
antibody/NSPLP binding (eg, a buffer of pH 2-3 or a high concentration of 
a chaotrope such as urea or thiocyanate ion), and NSPLP is collected. 

XII Identification of Mq1«gu1q8 Milch TntftmfTt. with NSPLP 

NSPLP, or biologically active fragments thereof, are labelled with 
l "l Bolton-Hunter reagent (Bolton, AE and Hunter, WM (1973) Biochem J 133: 
529) . Candidate molecules previously arrayed in the wells of a 96 well 
plate are incubated with the labelled NSPLP, washed and any wells with 
labelled NSPLP complex are assayed. Data obtained using different 
concentrations of NSPLP are used to calculate values for the number, 
affinity, and association of NSPLP with the candidate molecules. 

All publications and patents mentioned in the above specification 
are herein incorporated by reference. Various modifications and 
variations of the described method and system of the invention will be 
apparent to those skilled in the art without departing from the scope and 

35 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06841 PCT/US97/13469 

spirit of the invention. Although the invention has been described in 
connection with specific preferred embodiments, it should be understood 
that the invention as claimed should not be unduly limited to such 
specific embodiments- Indeed, various modifications of the described 
5 modes for carrying out the invention which are obvious to those skilled 

in molecular biology or related fields are intended to be within the 
scope of the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION 

(i) APPLICANT: INCYTE PHARMACEUTICALS , INC. 

(ii) TITLE OF THE INVENTION: TWO NOVEL HUMAN NSP-LIKE PROTEINS 

(iii) NUMBER OF SEQUENCES: 9 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 
<B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: U.S. 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) PCT APPLICATION NUMBER: To Be Assigned 

(B) FILING DATE: Filed Herewith 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/700,607 

(B) FILING DATE: AUGUST 12, 1996 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE /DOCKET NUMBER: PF-0114 PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: Consensus 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



Met Asp 


Gly 


Gin 


Lys 


Lys 


Asn 


Trp 


Lys 


Asp 


Lys 


Val 


Val 


Asp 


Leu 


Leu 


1 






5 










10 










15 




Tyr Trp Arg 


Asp 


He 


Lys 


Lys 


Thr Gly Val Val 


Phe 


Gly Ala Ser Leu 






20 










25 










30 






Phe Leu 


Leu 
35 


Leu 


Ser 


Leu 


Thr 


Val 
40 


Phe 


Ser 


He 


Val 


Ser 
45 


Val 


Thr 


Ala 


Tyr lie 


Ala 


Leu 


Ala 


Leu 


Leu 


Ser 


Val 


Thr 


He 


Ser 


Phe 


Arg 


He 


Tyr 


50 










55 










60 










Lys Gly Val 


He 


Gin 


Ala 


He 


Gin 


Lys 


Ser 


Asp 


Glu 


Gly 


His 


Pro 


Phe 


65 








70 










75 










80 


Arg Ala 


Tyr 


Leu 


Glu 


Ser 


Glu 


Val 


Ala 


He 


Ser 


Glu 


Glu 


Leu 


Val 


Gin 




85 










90 










95 




Lys Tyr 


Ser 


Asn 


Ser 


Ala 


Leu Gly 


His Val Asn Cys 


Thr 


He 


Lys 


Glu 




100 










105 










110 






Leu Arg 


Arg 
115 


Leu 


Phe 


Leu 


Val 


Asp 
120 


Asp 


Leu 


Val 


Asp 


Ser 
125 


Leu 


Lys 


Phe 


Ala Val 


Leu 


Met 


Trp 


Val 


Phe 


Thr 


Tyr 


Val 


Gly Ala 


Leu 


Phe Asn Gly 


130 








135 










140 










Leu Thr 


Leu 


Leu 


He 


Leu 


Ala 


Leu 


He 


Ser 


Leu 


Phe 


Ser 


Val 


Pro 


Val 


145 








150 










155 










160 


lie Tyr 


Glu 


Arg 


His 


Gin 


Ala 


Gin 


He 


Asp 


His 


Tyr 


Leu Gly 


Leu 


Ala 






165 










no 










175 




Asn Lys 


Asn 


Val 


Lys Asp Ala Met Ala 


Lys 


He 


Gin 


Ala 


Lys 


He 


Pro 






180 










185 










190 






Gly Leu 


Lys 
195 


Arg 


Lys 


Ala 


Glu 





















(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: 

(B) CLONE: Consensus 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

GGTTTGTGCA GTTACAGCTT TTCTNTTGGT ATGCATAATT AATANTTGGA GCTGCAAAGA 60 
GATCGTGACA AGAGATGGAC GGTCAGAAGA AAAATTGGAA GGACAAGGTT GTTGACCTCC 120 
TGTACTGGAG AGACATTAAG AAGACTGGAG TGGTGTTTGG TGCCAGCCTA TTCCTGCTGC 180 



38 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06841 



PCT/US97/13469 



TTTCATTGAC AGTATTCAGC ATTGTGAGCG TAACAGCCTA CATTGCCTTG GCCCTGCTCT 240 

CTGTGACCAT CAGCTTTAGG ATATACAAGG GTGTGATCCA AGCTATCCAG AAATCAGATG 300 

AAGGCCACCC ATTCAGGGCA TATCTGGAAT CTGAAGTTGC TATATCTGAG GAGTTGGTTC 360 

AGAAGTACAG TAATTCTGCT CTTGGTCATG TGAACTGCAC GATAAAGGAA CTCAGGCGCC 420 

TCTTCTTAGT TGATGATTTA GTTGATTCTC TGAAGTTTGC AGTGTTGATG TGGGTATTTA 480 

CCTATGTTGG TGCCTTGTTT AATGGTCTGA CACTACTGAT TTTGGCTCTC ATTTCACTCT 540 

TCAGTGTTCC TGTTATTTAT GAACGGCATC AGGCACAGAT AGATCATTAT CTAGGACTTG 600 

CAAATAAGAA TGTTAAAGAT GCTATGGCTA AAATCCAAGC AAAAATCCCT GGATTGAAGC 660 

GCAAAGCTGA ATGAAAACGC CCAAAATAAT TAGTAGGAGT TCATCTTTAA AGGGGATATT 720 

CATTTGATTA TACGGGGGAG GGTCAGGGAA GAACGACCTT GACGTTGCAG TGCAGTTTCA 780 

CAGATCGTTG TTAGATCTT 799 



(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: THP1NOB01 

(B) CLONE: 31870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Met 


Ala 


Glu 


Arg 


Xaa 


Ala 


Ala 


Thr 


1 






5 








Ser 


Phe 


Gly 


Ala 


Glu 


Pro 


Ser 


Ala 








20 










Ala 


Cys 


Pro 


Ala 


Leu 


Gly 


Thr 


Lys 






35 










40 


His 


Asp 


Leu 


He 


Xaa 


Trp 


Arg 


Asp 




50 










55 




Gly 


Thr 


Thr 


Leu 


He 


Met 


Leu 


Leu 


65 










70 






Ser 


Val 


Val 


Ser 


Tyr 


Leu 


He 


Leu 










85 








Phe 


Arg 


He 


Tyr 


Lys 


Ser 


Val 


He 








100 










Gly 


His 


Pro 


Phe 


Lys 


Ala 


Tyr 


Leu 






115 










120 


Glu 


Ala 


Phe 


His 


Asn 


Tyr 


Met 


Asn 




130 








135 




Ala 


Leu 


Lys 


Leu 


lie 


He 


Arg 


Leu 


145 










150 






Ser 


Leu 


Lys 


Leu 


Ala 


Val 


Phe 


Met 










165 








Val 


Phe 


Asn 


Gly 


He 


Thr 


Leu 


Leu 








180 










Ser 


Val 


Pro 


He 


Val 


Tyr 


Xaa 


Lys 






195 










200 


Trp 


Asn 


Arg 


Gin 


Lys 


Lys 


Gly 


Arg 




210 










215 




Gin 


Gin 


Leu 


Leu 


Lys 


His 


His 


Leu 


225 








230 






Leu 

















Gin 


Ser 


His 


Ser He 


Ser Ser Ser 




10 






15 


Pro 


Gly Gly 


Gly Gly Ser Pro Gly 


25 








30 


Ser 


Cys 


Ser 


Ser Ser 


Cys Ala Val 








45 




Val 


Lys 


Lys 


Thr Gly 


Phe Val Phe 








60 




Ser 


Leu 


Ala 


Ala Phe 


Ser Val He 






75 




60 


Ala 


Leu 


Leu 


Ser Val 


Thr He Ser 




90 






95 


Gin 


Ala 


Val 


Gin Lys 


Ser Glu Glu 


105 








110 


Asp 


Val Asp 


He Thr 


Leu Ser Ser 








125 




Ala 


Ala 


Met 


Val His 


He Asn Arg 








140 


Phe 


Leu 


Val 


Glu Asp Leu Val Asp 






155 




160 


Trp 


Leu 


Met 


Thr Tyr Val Gly Ala 




170 






175 


He 


Leu 


Ala 


Glu Leu 


Leu He Xaa 


185 








190 


Tyr 


Lys 


Val 


Pro Ser Lys Thr Pro 








205 




He 


Ser 


Thr 


Trp Lys 


Pro Glu Met 








220 




He 


Val 


He 


Thr Ser 


Leu Leu Val 






235 




240 



(2) INFORMATION FOR SEQ ID NO: 4: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: THP1NOB01 

(B) CLONE: 31870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ACACNAGCGN NTCGNGCTCC CGAACCTCTA GCTGCGACTC GGANTGAGTC AGTCAGTCTG 60 

TCGGAGTCTG TCCTCGGAGC AGGCGGAGTA AAGGGACTTG AGCGAGCCAG TTGCCGGATT 120 

ATTCTATTTC CCCTCCCTCT CTCCCGCCCC GTATCTCTTT TCATTTTNNT NCCACCCTTG 180 

CTCGCGTANC ATGGCGGAGC GTNCGGCGGC CACTCAGTCC CATTCCATCT CCTCGTCGTC 240 

CTTCGGAGCC GAGCCGTCCG CGCCCGGCGG CGGCGGGAGC CCAGGAGCCT GCCCCGCCCT 300 

GGGGACGAAG AGCTGCAGCT CCTCCTGTGC GGTGCACGAT CTGATTTTMT GGAGAGATGT 360 

GAAGAAGACT GGGTTTGTCT TTGGCACCAC GCTGATCATG CTGCTTTCCC TGGCAGCTTT 420 

CAGTGTCATC AGTGTGGTTT CTTACCTCAT CCTGGCTCTT CTCTCTGTCA CCATCAGCTT 480 

CAGGATCTAC AAGTCCGTCA TCCAAGCTGT ACAGAAGTCA GAAGAAGGCC ATCCATTCAA 540 

AGCCTACCTG GACGTAGACA TTACTCTGTC CTCAGAAGCT TTCCATAATT ACATGAATGC 600 

TGCCATGGTG CACATCAACA GGGCCCTGAA ACTCATTATT CGTCTCTTTC TGGTAGAAGA 660 

TCTGGTTGAC TCCTTGAAGC TGGCTGTCTT CATGTGGCTG ATGACCTATG TTGGTGCTGT 720 

TTTTAACGGA ATCACCCTTC TAATTCTTGC TGAACTGCTC ATTTTNAGTG TCCCGATTGT 780 

NTATNAGAAG TACAAGGTTC CAAGCAAAAC TCCCTGGAAT CGCCAAAAAA AAGGCAGAAT 840 

AAGTACATGG AAACCAGAAA TGCAACAGTT ACTAAAACAC CATTTAATAG TTATAACGTC 900 

GTTACTTGTA CTATGAAGGA AAATACTCAG TGTCAGCTTG AGCCTGCATT CCAAGCTTTT 960 

TTTTTAATTT GGTGGTTTTC TCCCATCCTT TCCCTTTAAC CCTCAGTNTC AAGCACAAAN 1020 

TTTNATGGAC TGATAANNGA TCTATNTTAG ANCTCAGAAG ANGANAGNTT CANNTGCATA 1080 

GGNTAAGGNA NTACC 1095 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 776 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank. 

(B) CLONE: 307307 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



Met 


Ala 


Ala 


Pro Gly 


Asp Pro 


Gin 


Asp 


Glu Leu Leu Pro Leu Ala Gly 
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5 








10 15 


Pro 


Gly Ser 


Gin Trp 


Leu Arg 


His 
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Gly Glu Gly Glu Asn Glu Ala 








20 






25 


30 


Val 


Thr 


Pro 


Lys Gly 


Ala Thr 


Pro 


Ala 


Pro Gin Ala Gly Glu Pro Ser 






35 






40 




45 


Pro 


Gly 


Leu 


Gly Ala 


Arg Ala Arg 


Glu 


Ala Ala Ser Arg Glu Ala Gly 




50 






55 






60 


Ser 


Gly 


Pro 


Ala Arg 


Gin Ser 


Pro 


Val 


Ala Met Glu Thr Ala Ser Thr 


65 






70 






75 80 


Gly 


Val 


Ala 


Gly Val 


Ser Ser 


Ala 


Met 


Asp His Thr Phe Ser Thr Thr 



85 90 95 
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Ser 


Lys 


Asp 


Glv 








100 


Cys Tyr 


Pro 


Pro 






115 




Lys Glu 


Asn 


Glv 




130 






Thr 


Pro 


Glv 


Pro 


145 








Leu 


Phe 


Ser 


Ser 


Glu 


Val 


Asn 


Lys 










Ala 


Tyr 


Lys 


Tyr 






1 QC. 
i7J 




Glu 


Gin 


HIS 


His 




210 






Lys Asp 


1 ill 


Asp 


225 








Lys 


Pro 


Ala 


Pro 


Glu 


Ser 


Thr 


Phe 








260 


Arg Ala 


Pro 


Gin 






275 




Glu 


Pro 


Ser 


Val 




290 






Asp 


He 


Cys 


Leu 


305 








Ser 


Glu 


Pro 


Glu 


Gly Thr 


Glu 


Pro 








340 


Glu Asp 


Glu 


Leu 






ice 

355 




Glu 


Thr 


Ala 


GlU 




370 






Glu 


Val 


Lys 


Ala 


385 






His 


Glu 


Ala 


Ser 


Ser 


Glu 


Asp 












Ser 


Phe 


Gly 


His 






4 IS 




He 


Gin 


Tx/r 
1 J c 






450 






Glu 


Leu 


He 


Tic* 

IXC 


465 








Pro Lys 


Arg 


Glu 


Ala 


lie 


Arg 


Glu 








500 


Arg Arg 


Gly 


Leu 






515 




Glu 


Pro 


Gin 


Pro 




530 






Pro 


Glu 


Thr 


Pro 


545 








Asn 


Gin 


Ser 


Pro 


Ala 


Pro 


Pro 


Pro 








580 



Glu 


Gly 


Ser 


Cys 


Gin 


Glu 


Asp 


Ser 








120 


His 


Val 


Thr 


lie 






135 




Ser 


Leu 


Pro 


Asp 




150 






Asp 


Ser 


Glv 


He 










He 


Leu 


Ala 


Asp 


lie 


Asp 


Tie 
116 


i nr 








t uu 


Pro 
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Leu 


p i ti 

blU 
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C A V 


Tin 

x le 


Lys 
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PI II 
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Lys 


O jic 

z4b 








Ala 


Pro 


Tyr 


He 


He 


Thr 


Thr 


Pro 








"> O A 


Glu 


Thr 


Tnr 


Thr 










Lys 


fro 


oer 


Pro 




310 






Asp 


Asp 


Ser 


Pro 


T o c 








Ser 


Ala 


Ala 


Glu 


T 1 n. 

lie 


Tnr 


Ala 


T "1 a. 

I xe 










Asn 


Pro 


Arg 


Pro 






J / D 




Arg 


Ser 


uiy 


Pro 










Cor 


Mia 


ulu 


Ser 










Met 


Ala 


Ala 


Glu 


VOX 


vjiy 


ux y 










440 


He 


Leu 


Arg 


Glu 






455 




Glu 


Ser 




Asp 




470 






Gin 


Asp 


Ser 


Pro 


485 








Glu 


Thr 


Glv 


Val 


Ala 


Glu 


Pro 


Gly 








520 


Gly 


Pro 


Glu 


Leu 






535 




Met 


Leu 


Pro 


Arg 




550 






Ala 


Ala 


Thr 


Lys 


565 






Leu 


Leu 


Phe 


Leu 



Tyr 


Thr Ser 


Leu 


105 






Thr 


Tyr Phe 


Thr 


Ser 


Glu Ser 


Pro 






140 


Val 


Pro Gly 


He 




155 




Glu 


Met Thr 


Pro 




170 




Pro 


Leu Asp 


Gin 


185 






Arg 


Pro Glu 


Glu 


Asp 


Lys Asp 


Leu 










nit* m m 
uiu biy 


Vdi 








T 1 a 

xxe 


He Lys 


Asp 








Asp 


Asp Leu 


Ser 








Val 


Lys He 


Thr 


bin 


Glu Lys 


Tnr 




.JUU 


ASp 


Tnr val 


Pro 




JlD 




Pt w 

vjxy 


Cav Tin 

oer xxe 


inr 




J JU 




ber 


bin Gly 


Lys 


A 






Lys 


Glu Ala 


Lys 


vax 


PI \r rin 

biy bin 


Leu 






iq n 


Pro 


inr xxe 


rio 








oiy 


Asp Ser 


uiu 








Asp 


Al A T.OH 
rum 


Pro 


425 






Pro 


Pro Ser 


Pro 


Glu 


Arg Glu 


Ala 






460 


Ala 


Ser Ser 


Ala 




475 




Pro 


Met Lys 


Pro 




490 




Arg 


Ala Glu 


Glu 


505 






Ser 


Phe Leu 


Asp 


Pro 


Pro Gly 


Asp 






540 


Lys 


Pro Glu 


Glu 




555 




Gly 


Pro Gly 


Pro 




570 




Asn 


Lys Gin 


Lys 


585 
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He 


Ser 


Asp 


He 




110 






Gly 


He 


Leu 


Gin 


125 








Glu 


Glu 


Leu 


Gly 


Glu 


Ser 


Arg 


Gly 








160 


Ala 


Glu" 


Ser 


Thr 






175 




Met 


Lys 


Ala 


Glu 




190 






Val 


Lys 


Hie 

nib 


bin 


205 








Asp 


Phe 


Lys 


Asn 


Arg Glu 


Pro 


ASp 








OAf\ 
Z4U 


His 


Leu 


Leu 


blU 










Glu 


Glu 


bin 


Arg 




270 






Leu 


Thr 


blU 


l le 


285 








Pro 


Glu 


Lys 


Gin 


Thr 


Val 


i nr 


vai 










Pro 


Pro 


O A V 

ser 


Ser 
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Ser 
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350 
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Tyr 


365 








Ala 


Asp 


Arg 


Pro 


Ser 


Pro 


Leu 


Asp 








d nn 


He 


Glu 


Leu 


Ua 1 
Vol 






41 S 




Ser 


Gly 




Val 




430 






Ala 


Ser 


Pro 


Ser 


445 








Glu 


Leu 


Asp 


Ser 


Ser 


Glu 


Glu 


Ser 








480 


Ser 


Ala 


Leu 


Asn 






4 95 




Arg Ala 


Pro 


Ser 




510 






Tyr 


Pro 


Ser 


Thr 


525 








Gly Ala 


Leu 


Glu 


Asp 


Ser 


Ser 


Ser 








560 


Leu Gly 


Pro 


Gly 






575 




Ala 


He 


Asp 


Leu 




590 





SUBSTITUTE SHEET (RULE 26) 



WO 98/06841 



PCT/US97/13469 



Leu Tyr Trp Arg 


Asp 


Tin 

ilc 


Lys 












DUU 


Leu Leu Leu Leu 


rue 


C&v 


Leu 


T)-t r 


610 






DID 




Ala Tyr Leu Ala 


Leu 


Ala 


Ala 


Leu 


625 




630 






Tyr Lys Ser Val 


Leu 
645 


Gin 


Ala 


W-» 1 


Phe Lys Ala Tyr 


Leu 


Glu 


Leu 


blU 


660 










Gin Lys Tyr Thr 


Asp 


Cys 


Leu 


Gin 


675 








6B0 


Glu Leu Arg Arg 


Leu 


Phe 


Leu 


Val 


690 






695 




Phe Ala Val Leu 


Met 


Trp Leu 


Leu 


705 




710 






Gly Leu Thr Leu 


Leu 


Leu 


Met 


Ala 


725 








Val Val Tyr Val 


Lys 


His 


Gin 


Ala 


740 










Val Arg Thr His 


He 


Asn 


Ala 


Val 


755 








760 


Pro Gly Ala Lys 


Arg 


His 


Ala 


Glu 


770 






775 





Thr Gly 


He 


Val 


Phe 


Gly 


Ser 


Phe 










605 








Gin 


Phe 


Ser 


Val 
620 


Val 


JCl 


Val 


Val 


Ser 


Ala 


Thr 
635 


He 


Ser 


rne 


Arg 


Tie 
lie 

OflU 


Gin 


Lys 


Thr Asp Glu 


Gly 


His 


Pro 




650 










CAR 




He 


Thr 


Leu 


Ser 


Gin 


uiu 


Gin 


He 


665 










D /U 






Phe 


Tyr 


Val 


Asn 


Ser 
685 


Thr 


Leu 


Lys 


Gin Asp 


Leu 


Val 


Asp 


Ser 


Leu 


Lys 








700 










Thr 


Tyr 


Val 


Gly Ala 


Leu 


Phe 


Asn 




715 










720 


Val 


Val 
730 


Ser 


Met 


Phe 


Thr 


Leu 
735 


Pro 


Gin 


He 


Asp 


Gin 


Tyr 


Leu 


Gly Leu 


745 










750 






Val 


Ala 


Lys 


He 


Gin 
765 


Ala 


Lys 


He 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank 

(B) CLONE: 307309 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Ala Glu Asp Ala Leu Pro Ser Gly Tyr Val Ser Phe Gly His 

15 10 15 

Val Gly Gly Pro Pro Pro Ser Pro Ala Ser Pro Ser He Gin Tyr Ser 

20 25 30 

He Leu Arg Glu Glu Arg Glu Ala Glu Leu Asp Ser Glu Leu He He 

35 40 45 

Glu Ser Cys Asp Ala Ser Ser Ala Ser Glu Glu Ser Pro Lys Arg Glu 

50 55 60 

Gin Asp Ser Pro Pro Met Lys Pro Ser Ala Leu Asp Ala He Arg Glu 
65 70 75 80 

Glu Thr Gly Val Arg Ala Glu Glu Arg Ala Pro Ser Arg Arg Gly Leu 

85 90 95 

Ala Glu Pro Gly Ser Phe Leu Asp Tyr Pro Ser Thr Glu Pro Gin Pro 

100 105 110 

Gly Pro Glu Leu Pro Pro Gly Asp Gly Ala Leu Glu Pro Glu Thr Pro 

115 120 125 

Met Leu Pro Arg Lys Pro Glu Glu Asp Ser Ser Ser Asn Gin Ser Pro 

130 135 140 

Ala Ala Thr Lys Gly Pro Gly Pro Leu Gly Pro Gly Ala Pro Pro Pro 
145 150 155 160 

Leu Leu Phe Leu Asn Lys Gin Lys Ala He Asp Leu Leu Tyr Trp Arg 
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165 170 175 



Asp 


lie 


Lvs 


Gin 


Thr Gly 


He Val Phe 


Gly 


Ser. 


Phe 


Leu 


Leu 


Leu 


Leu 








180 






185 










190 






Phe 


Ser 


Leu 
195 


Thr 


Gin 


Phe 


Ser Val Val 
200 


Ser 


Val 


Val 


Ala 
205 


Tvr 


Leu 


Ala 


Leu 


Ala 


Ala 


Leu. 


Ser 


Ala 


Thr He Ser 


Phe 


Arg 


lie Tyr 


Lys 


Ser 


Val 




210 










215 






220 










Leu 


Gin 


Ala 


Val 


Gin 


Lys 


Thr Asn Glu 


Gly His 


Pro 


Phe 


Lys 


Ala 


Tvr 


225 










230 






235 










240 


Leu 


Glu 


Leu 


Glu 


He 
245 


Thr 


Leu Ser Gin 


Glu 
250 


Gin 


He 


Gin 


Lys 


Tyr 
255 


Thr 


Asp Cys 


Leu 


Gin 


Phe 


Tyr 


Val Asn Ser 


Thr 


Leu 


Lys 


Glu 


Leu 


Arg 


Arg 
























C, f V 






Leu 


Phe 


Leu 


Val 


Gin Asp 


Leu Val Asp 


Ser 


Leu 


Lys 


Phe 


Ala 


Val 


Leu 






£. 1 D 
















265 








Met 


Trp 


Leu 


Leu 


Thr 


Tyr 


Val Gly Ala 


Leu 


Phe Asn Gly 


Leu 


Thr 


Leu 




290 










295 






300 










Leu 


Leu 


Met 


Ala 


Val 


Val 


Ser Met Phe 


Thr 


Leu 


Pro 


Val 


Val 


Tyr 


Val 


305 










310 






315 








320 


Lys 


His 


Gin 


Ala 


Gin 
325 


He 


Asp Gin Tyr 


Leu 
330 


Gly 


Leu 


Val 


Arg 


Thr 
335 


His 


lie 


Asn 


Ala 


Val 
340 


Val 


Ala 


Lys He Gin 
34 5 


Ala 


Lys 


He 


Pro 


Gly 
350 


Ala 


Lys 


Arg 


His 


Ala 
355 


Glu 























(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 
(A) LIBRARY: GenBank 
EB) CLONE: 307311 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Met 


Gin 


Ala 


Thr 


Ala 


Asp 


Ser 


Thr 


Lys 


Met 


Asp 


Cys Val 


Trp 


Ser 


Asn 


1 








5 










10 








15 




Trp Lys 


Ser 


Gin 


Ala 


He 


Asp 


Leu 


Leu 


Tyr 


Trp 


Arg Asp 


He 


Lys 


Gin 








20 










25 








30 






Thr 


Gly 


He 
35 


Val 


Phe 


Gly 


Ser 


Phe 
40 


Leu 


Leu 


Leu 


Leu Phe 
45 


Ser 


Leu 


Thr 


Gin 


Phe 


Ser 


Val 


Val 


Ser 


Val 


Val 


Ala 


Tyr 


Leu 


Ala Leu 


Ala 


Ala 


Leu 




50 










55 








60 








Ser 
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He 


Ser 


Phe 


Arg 


He 


Tyr 


Lys 


Ser 


Val Leu 


Gin 


Ala 


Val 


65 










70 






75 








80 


Gin 


Lys 


Thr 


Asp 


Glu 
85 


Gly 


His 


Pro 


Phe 


Lys 
90 


Ala 


Tyr Leu 


Glu 


Leu 
95 


Glu 


He 


Thr 


Leu 


Ser 
100 


Gin 


Glu 


Gin 


He 


Gin 
105 


Lys 


Tyr 


Thr Asp 


Cys 
110 


Leu 


Gin 


Phe 


Tyr 


Val 


Asn 


Ser 


Thr 


Leu 


Lys 


Glu 


Leu 


Arg 


Arg Leu 


Phe 


Leu 


Val 






115 










120 






125 








Gin 


Asp 


Leu 


Val 


Asp 


Ser 


Leu 


Lys 


Phe 


Ala 


Val 


Leu Met 


Trp 


Leu 


Leu 




130 










135 










140 






Thr 


Tyr 


Val 


Gly 


Ala 


Leu 


Phe 


Asn 


Gly 


Leu 


Thr 


Leu Leu 


Leu 


Met 


Ala 


145 










150 










155 








160 


Val 


Val 


Ser 


Met 


Phe 


Thr 


Leu 


Pro 


Val 


Val 


Tyr 


Val Lys 


His 


Gin 


Ala 
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165 170 175 

Gin lie Asp Gin Tyr Leu Gly Leu Val Arg Thr His lie Asn Ala Val 

1B0 185 190 

Val Ala Lys lie Gin Ala Lys He Pro Gly Ala Lys Arg His Ala Glu 

195 200 205 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: GenBank, 

(B) CLONE: 281046 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Asp Cys Val Trp Ser Asn Trp Lys Ser Gin Ala He Asp Leu Leu 

15 10 15 

Tyr Trp Arg Asp He Lys Gin Thr Gly He Val Phe Gly Ser Phe Leu 

20 25 30 

Leu Leu Leu Phe Ser Leu Thr Gin Phe Ser Val Val Ser Val Val Ala 

35 40 45 

Tyr Leu Ala Leu Ala Ala Leu Ser Ala Thr He Ser Phe Arg He Tyr 

50 55 60 

Lys Ser Val Leu Gin Ala Val Gin Lys Thr Asp Glu Gly His Pro Phe 
65 70 75 80 

Lys Ala Tyr Leu Glu Leu Glu He Thr Leu Ser Gin Glu Gin He Gin 

85 90 95 

Lys Tyr Thr Asp Cys Leu Gin Leu Tyr Val Asn Ser Thr Leu Lys Glu 

100 105 HO 

Leu Arg Arg Leu Phe Leu Val Gin Asp Leu Val Asp Ser Leu Lys Phe 

115 120 125 

Ala Val Leu Met Trp Leu Leu Thr Tyr Val Gly Ala Leu Phe Asn Gly 

130 135 140 

Leu Thr Leu Leu Leu Met Ala Val Val Ser Met Phe Thr Leu Pro Val 
145 150 155 160 

Val Tyr Val Lys His Gin Ala Gin Val Asp Gin Tyr Leu Gly Leu Val 

165 170 175 

Arg Thr His He Asn Thr Val Val Ala Lys He Gin Ala Lys He Pro 

160 185 190 

Gly Ala Arg Gly Met Leu Ser Arg Trp Leu Pro Gin Glu Lys Pro Asp 

195 200 205 

Met Asn Gly Gly Val Trp Ser Gly Asn Ser Ser Leu Leu Pro Arg Tyr 

210 215 220 

Cys Glu Leu He Val Ser Leu Pro Gin Tyr His Asn Leu Arg Gly Lys 
225 230 235 240 

Leu Arg Asp Arg Cys Phe Gin Ser Phe Pro Val Leu Leu Gly Tyr Leu 

245 250 255 

Ser Pro Pro Arg Pro Leu Ser Ser Thr Lys Val 
260 265 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: SPLNFET01 

(B) CLONE: 28742 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

• 

CCTATNCCNG CTGCTTTCAT TGACAGTATT CAGCATTGTG AGCGTAACAG CCTACATTGC 60 

CTTNGCCCTG CNCTCTGTGA CCATCAGCTN TAGGCTATAC AAGGGTGTGA TCCAAGCTAT 120 

CCAGAAATCA GATGAAGGNC ACCCATTCAG GGCATATCTG GANTCTGAAG TTGCTATATC 180 

TGAGGAGTTG NTTCAGAAGT ACACGTAAAT NNTGNNCNTG GTCAATGTGA NCTCCACGNC 240 

TAANGGANCT CAGGTGCCTA T 261 
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CLAIMS 

1. A substantially purified human NSPLP protein comprising the 
amino acid sequence of SEQ ID KO:l or fragments thereof. 

2. An isolated and purified polynucleotide sequence encoding a 
protein of claim 1. 

3. An isolated and purified polynucleotide sequence of claim 2 
consisting of the sequence of SEQ ID NO: 2 or degenerate variants thereof. 

4. A polynucleotide sequence fully complementary to the sequence 
of SEQ ID NO: 2 or degenerate variants thereof. 

5. An isolated and purified polynucleotide sequence of claim 2 
consisting of a polynucleotide sequence that hybridizes under stringent 
hybridization conditions to the sequence of SEQ ID NO:2. 

6. A recombinant expression vector containing a polynucleotide 
sequence of claim 2. 

7. A recombinant host cell comprising a polynucleotide sequence of 

claim 2. 

8. A method for producing a polypeptide comprising the amino acid 
sequence shown in SEQ ID NO:l, the method comprising the steps of: 

a) culturing the host cell of Claim 7 under conditions 
suitable for the expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture . 

9. A recombinant expression vector containing a polynucleotide 
sequence of claim 5. 

10. A recombinant host cell comprising a polynucleotide sequence 

of claim 9. 

11. A pharmaceutical composition comprising a substantially 
purified human NSPLP protein (SEQ ID NO:l) in conjunction with a suitable 
pharmaceutical carrier . 

12. A purified antibody which binds specifically to a polypeptide 

of claim 1. 

13. A purified antagonist which specifically blocks or reduces the 
activity of the polypeptide of claim 1. 

14. A pharmaceutical composition comprising a substantially 
purified antagonist of the polypeptide of claim 1 in conjunction with a 
suitable pharmaceutical carrier. 

15. A substantially purified human NSPLP protein comprising the 
amino acid sequence of SEQ ID NO: 3 or fragments thereof. 

16. An isolated and purified polynucleotide sequence encoding a 
protein of claim 15. 

17. An isolated and purified polynucleotide sequence of claim 16 
consisting of the sequence of SEQ ID NO: 4 or degenerate variants thereof. 

18. A polynucleotide sequence fully complementary to the sequence 
of SEQ ID NO: 4 or degenerate variants thereof. 
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19. An isolated and purified polynucleotide sequence of claim 16 
consisting of a polynucleotide sequence that hybridizes under stringent 
hybridization conditions to the sequence of SEQ ID NO: 4. 

20. A recombinant expression vector containing a polynucleotide 
sequence of claim 16. 

21. A recombinant host cell comprising a polynucleotide sequence 
of claim 16 # . 

22. A method for producing a polypeptide comprising the amino acid 
sequence shown in SEQ ID NO: 3, the method comprising the steps of: 

a) culturing the host cell of Claim 21 under conditions 
suitable for the expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

23. A recombinant expression vector containing a polynucleotide 
sequence of claim 19. 

24. A recombinant host cell comprising a polynucleotide sequence 
of claim 23. 

25. A pharmaceutical composition comprising a substantially 
purified human NSPLP protein (SEQ ID NO: 3) in conjunction with a suitable 
pharmaceutical carrier. 

26. A purified antibody which binds specifically to a polypeptide 
of claim 15. 

27. A purified antagonist which specifically blocks or reduces the 
activity of the polypeptide of claim 15. 

28. A pharmaceutical composition comprising a substantially 
purified antagonist of the polypeptide of claim 15 in conjunction with a 
suitable pharmaceutical carrier. 
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